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ABSTRACT 


> Context. In November 2019, eROSITA on board of SRG observatory started to map the entire sky in X-rays. After the 4-year survey program, it 
— will reach flux limits about 25 times deeper than ROSAT. During the SRG Performance Verification phase, eROSITA observed a contiguous 140 
= deg? area of the sky down to the final depth of the eROSITA all-sky survey ("eROSITA Final Equatorial-Depth Survey": eFEDS), with the goal of 
obtaining a census of the X-ray emitting populations (stars, compact objects, galaxies, clusters of galaxies, AGN) that will be discovered over the 
m= entire sky. 
Aims. This paper presents the identification of the counterparts to the point-sources detected in eFEDS in the Main and Hard samples described in 
T Brunner et al. (2021), and their multi-wavelength properties, including redshift. 

1 Methods. For the identification of the counterparts we combined the results from two independent methods (Nway and AsTROMATCH), trained on 
the multi-wavelength properties of a sample of 23k XMM-Newton sources detected in the DESI Legacy Imaging Survey DR8. Then spectroscopic 
redshifts and photometry from ancillary surveys are collated for the computation of photometric redshifts. 

1 Results. The eFEDS sources with a reliable counterparts are 24774/27369 (90.5%) in the Main sample and 231/246 (93.9%) in the Hard sample, 
including 2514 (3) sources for which a second counterpart is equally likely. By means of reliable spectra, Gaia parallaxes, and/or multi-wavelength 
= properties we have classified the reliable counterparts in both samples as ’Galactic’ (2695) and ’extragalactic’ (22079). For about 340 of the 
U^ extragalactic sources we cannot rule out the possibility that they are unresolved clusters or belong to clusters. Inspection of the distributions of the 
X-ray sources in various optical/IR colour-magnitude spaces reveal a rich variety of diverse classes of objects. The photometric redshifts are most 
reliable within the KiDS/VIKING area, where also deep near-infrared data are available. 
Conclusions. This paper is accompanying the eROSITA early data release of all the observations performed during the performance and verifi- 
cation phase. Together with the catalogs of primary and secondary counterparts to the Main and Hard samples of the eFEDS survey this paper 


releases their multi-wavelength properties and redshifts. 


© 
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© Across the electromagnetic spectrum, sensitive wide-area sur- tating stars are all phenomena accessible by X-ray sensitive in- 
= veys serve multiple purposes. First and foremost, they help as- struments. 
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doing so they reveal the inner workings of the Milky Way, the 
local group, and the filamentary large scale structure underpin- 
ning the distribution of matter. Secondly, by observing and cat- 
aloguing large numbers of stars, galaxies, groups, clusters and 
super-clusters of galaxies that are the main visible tracers of this 
large-scale structure, wide area surveys also provide new statis- 
tical tools for the study of classes and populations of astronom- 
ical objects, thus helping astronomers to better understand their 
life-cycles, their interactions and, ultimately, their physical prop- 
erties. 

X-ray surveys, in particular, reveal fundamental physical 
processes invisible at other wavelengths. The hot, diffuse plasma 
that virialises and thermalises within massive dark matter knots; 
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Telescope Array; Predehl et al. 2021), onboard the Spektrum- 
RÓntgen-Gamma (SRG) mission (Sunyaev et al. 2021), was de- 
signed to provide sensitive X-ray imaging and spectroscopy over 
a large field of view, thus unlocking unprecedented capabilities 
for surveying large areas of the sky to deep flux levels. More- 
over, the SRG mission plan includes a long (4 years), uninter- 
rupted all-sky survey program (the eROSITA All-Sky Survey: 
eRASS; Predehl et al. 2021) capable of detecting, for the first 
time, millions of X-ray sources. 

In order to demonstrate these ground-breaking survey ca- 
pabilities, and prepare for the science exploitation of the up- 
coming all-sky survey, the contiguous 140 square degrees of 
the eROSITA Final Equatorial-Depth survey (eFEDS; Brunner 
et al. 2021) was observed during the SRG Calibration and Perfor- 
mance Verification phase, between the 3! and the 7 of Novem- 
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Fig. 1: eFEDS X-ray and multi-wavelength coverage. The thick blue line shows the outer bound of the region that was searched for 
X-ray sources. The thin blue beaded line shows the region having at least 500 seconds of effective X-ray exposure depth. We indicate 
the approximate coverage of several selected surveys that are particularly important for this work; Subaru HSC-Wide (green shaded 
region), KiDS/VIKING (magenta dashed box), GAMA09-DR3 (red hatched box). The eFEDS field is also immersed in several other 
important surveys that completely (or almost-completely) enclose the displayed region: e.g. Galex all-sky surveys (in UV), Gaia (in 
optical), Legacy Survey DR8 (optical combined with Gaia and WISE), VHS and UKIDSS (in Near-Infrared), WISE/NEOWISE-R, 


and SDSS (optical imaging and spectroscopy). 


ber 2019. The entire field, centreed at R.A. 136 and Dec. +2 
(see Figure 1), was observed to an approximate depth of ~2.2 ks 
(~1.2 ks after correcting for telescope vignetting), correspond- 
ing to a limiting flux of Fos.2xv ~ 6.5 x 10° ergs”! cm”. 
The eFEDS field was chosen from among the extragalactic areas 
with the richest multi-wavelength coverage visible by eROSITA 
in November 2019. The observations are just about 5096 deeper 
than anticipated for eRASS:8 at the end of the planned 4-year’ 
program in the ecliptic equatorial region (~ 1.1 x 10714 erg 
cm ? s~!, Predehl et al. 2021). Said otherwise, the eFEDS expo- 
sure corresponds roughly to the 80th percentile of the expected 
eRASS:8 exposure distribution over the whole sky and, as such, 
eFEDS is a fair representation of what the final eROSITA all- 
sky survey will be, enabling scientists to face and solve the chal- 
lenges that will accompany their work for the duration of the 
survey. 

As discussed in detail in Brunner et al. (2021), the X-ray 
catalogs generated by the analysis of the eFEDS eROSITA data 
comprise a Main one, with 27910 sources detected above a de- 
tection likelihood of 6 in the most sensitive 0.2-2.3 keV band, 
and a Hard one, containing 246 sources detected above a detec- 
tion likelihood of 10 in the less-sensitive 2.3-5 keV band. 

In this paper, we focus our attention on the point-like (i.e. 
with an extension likelihood EXT. LIKE-0!) X-ray sources con- 
tained in these catalogs (27369 and 246 for the Main and 
Hard sample, respectively), and describe in detail the proce- 
dure to (1) identify reliably multi-wavelength counterparts to the 


! this parameter is obtained from the task srctool of the eSASS soft- 
ware; (Brunner et al. 2021) 
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eROSITA sources, (ii) classify and characterise their properties 
and (iii) provide reliable redshift measurements (spectroscopic 
when available and photometric otherwise). The identification 
and determination of the reliability of the counterparts, the com- 
putation of the photometric redshifts (photo-z), and the charac- 
terisation of the sample follow the same procedure for both Main 
and Hard samples, and for simplicity we discuss here specif- 
ically only the Main sample, given the large overlap between 
the two catalogs (226/246 hard sources are in common). While 
we provide here the catalog of counterparts for all the sources 
in both samples, the properties of the sources in the Hard sam- 
ple are presented and discussed in Nandra et al. (in prep.). The 
papers on X-ray spectral analysis (Liu et al. 2021b), variabil- 
ity (Boller et al. 2021; Buchner et al. 2021), X-ray Luminosity 
Function (Buchner et al., in prep; Wolf et al, in prep) and host 
properties of eFEDS AGN (Li et al., in prep.) are all based on the 
catalog and/or the methodology presented in this work. Based on 
this work are also the papers presenting interesting single objects 
(Brusa et al. 2021; Toba et al. 2021; Wolf et al. 2021), the X-ray 
properties of WISE sources in eFEDS (Toba et al. 2021) , radio 
properties of unresolved clusters (Bulbul et al. 2021) and photo-z 
computed via machine learning (Nishizawa et al., in prep), 


The structure of the paper is as follows: In section 2 we sum- 
marise the availability of ancillary data that will be used for the 
identification of the X-ray counterparts and photo-z estimates. 
Sections 3 and 4 describe in detail the methods used in this paper 
for identifying the counterparts, while in section 5 the counter- 
parts are finally assigned. In section 6 the counterparts are sepa- 
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rated in Galactic and extragalactic, using morphological, photo- 
metric and proper motion information. 


Because of its size, the field is well populated by stars, 
AGN, clusters and nearby galaxies. Each eROSITA working 
group has developed independent methods for the identifi- 
cation of sources of interest, and in Sections 5.3 and 7.5 
a comparison is made with two main source classes: stellar 
coronal emitters and clusters of galaxies, with the ultimate goal 
of consolidating the counterparts and classifying them at the 
same time. Section 6 deals with the multiwavelength properties 
of the counterparts and the characterisation of their Galactic 
or extra-galactic nature. Section 7 presents and discusses the 
photo-z computed with Le PHARE (Ilbert et al. 2006; Arnouts 
et al. 1999), including a comparison with DNNz (Nishizawa et 
al., in prep), an independent method based on machine learning. 
Section 8 describes the released data. The basic properties of the 
point-source eFEDS population based on redshift, photometry, 
and X-ray flux are presented in Section 9. The conclusions in 
Section 10 close the paper, with a forecast of the results and 
challenges that we will face when working on the eROSITA 
all-sky survey. 


The description of the Catalogs that we release is provided in 
the Appendix, together with the list of templates used for com- 
puting the photo-z. 

Throughout the paper we assume AB magnitudes unless dif- 
ferently stated. In order to allow direct comparison with exist- 
ing works from the literature of X-ray surveys, we adopt a flat 
ACDM cosmology with h = Ho/[l00kms ! Mpc^!] = 0.7; 
5420.3; Qa =0.7. 


2. Supporting ancillary data 


For studies of X-ray sources (taken singularly or as a popula- 
tion) the entire spectral energy distribution (SED) needs to be 
constructed and the redshift determined. Only rarely can red- 
shift be obtained directly from X-ray spectra, and it is instead 
routinely obtained either via optical/Near-Infrared spectroscopy 
or via photometric techniques. However, for that to work the 
counterparts to the X-ray sources need to be determined first. 
Deep and homogeneous multi-wavelength data are therefore a 
pre-requisite for any complete population study of an X-ray sur- 
vey. 

The main challenge is that, in survey mode, eROSITA has a 
Half Energy Width (HEW) of 26 arcsec? (Predehl et al. 2021), 
which makes the identification of the correct counterparts not 
at all trivial (e.g., see the review of Naylor et al. 2013; Sal- 
vato et al. 2019), especially if considering in addition that wide 
field multi-wavelength homogeneous surveys are very difficult 
to obtain, with very few exceptions (Dey et al. 2019a, see the 
Legacy imaging survey supporting the Dark Energy Spectro- 
scopic Instrument, DESI;[). By construction, the eFEDS field 
is placed in an area fully encompassing the GAMAOO equato- 
rial field (Driver et al. 2009), and rich with ancillary photomet- 
ric and spectroscopic data (Merloni et al., in prep.). Below we 
list and describe in more detail the surveys that have been used 
in this work. Table 1 summarises the depth in each filter, while 
Figure 2 shows the coverage of the ancillary data in wavelength. 


? ie., comparable to the XMM Slew Survey: https: //www.cosmos. 
esa.int/web/xmm-newton/xmms12-ug 


2.1. Supporting the associations 


The identification of the counterparts is carried out using the 
DESI Legacy Imaging Survey DR8 (LS8; Dey et al. 2019b), for 
various reasons. First of all, it covers homogeneously the field 
and it has sufficient depth, based on the expectation of the X-ray 
population optical properties (Merloni et al. 2012). In addition, 
the survey provides together with Gaia also the AIIWISE trac- 
tor (Lang 2014) photometry extracted at the position of the op- 
tical sources. Finally, the survey covers 14,000 square degrees 
of sky thus providing a sufficient number of sources external to 
eFEDS that can be used as training and validation samples for 
testing the association (see Section 3). The absolute astrometry 
of the LS8 catalogue is registered to the Gaia DR2 astrometric 
frame, with residuals typically smaller than 30 milliarcseconds?. 
While the catalogued positional uncertainties of individual LS8 
sources are often much larger than this systematic limit (espe- 
cially toward fainter magnitudes), they are still extremely small 
relative to those of the eFEDS X-ray sources and are set to 0.1" 
for the entire survey. Photometry and parallax measures from 
Gaia, which are optimised for point-like sources, are ideal for 
the identification of the stars in our sample. For this purpose, 
rather than the Gaia DR2 provided by LS8, the EDR3^ has been 
used (Gaia Collaboration et al. 2020). 


2.2. Supporting Photometric redshifts 


For the computation of the photo-z the following data sets have 
been used: 


— GALEX: The NASA satellite GALEX has mapped the entire 
sky in Far and Near UV between 2003 and 2012, with a typ- 
ical depth of 19.9 and 20.8 AB magnitude in FUV and NUV, 
respectively. We have used the catalog GR6/7 presented in 
Bianchi (2014) and available via Vizier. 

— Kilo-degree Survey (KiDS)*: the survey mapped 1350 deg? 
in u,g,r,i bands using VST/OmegaCAM. The same area was 
also covered by the VISTA Kilo-Degree Infrared Galaxy 
Survey. (VIKING, Edge et al. 2013) in Z, Y. JL, H,K. We have 
used here the catalog presented in Kuijken et al. (2019); it 
has ZYJHK aperture-matched, forced photometry to the ugri 
source positions. About 65 deg? of sky are shared between 
KiDS/VIKING and eFEDS. 

— HSC 819A: The Hyper Suprime-Cam (HSC; Miyazaki et al. 
2018) Subaru Strategic Program survey (HSC-SSP; Aihara 
et al. 20182) is an ongoing optical imaging survey with five 
broadband filters (g-, r-, i-, z-, and y-band) and four narrow- 
band filters (see Aihara et al. 2018b). We utilised 519A wide 
data obtained from March 2014 to April 2019, which pro- 
vides forced photometry for the five bands, with the 5c lim- 
iting magnitudes as listed in Table 1 (see Aihara et al. 2018b, 
2019). The astrometric uncertainty is approximately 40 mil- 
liarcseconds in rms. 

— VISTA/VHS: The entire Southern hemisphere has been ob- 
served by VISTA in Near Infrared and at least for J and Ks 
the depth is thirty times the depth of 2MASS (McMahon 
et al. 2013). We have used the DR4 data available via Vizier. 

— WISE: The Wide-field Infrared Survey Explorer (WISE; 
Wright et al. 20102), over the course of one year scanned 
the entire sky in the 3.4, 4.6, 12 and 22 um bands (hereafter 


3 https: //www.legacysurvey.org/dr8/description/ 
#astrometry 

B https://www.cosmos.esa.int/web/gaia/earlydr3 
5 http://kids.strw.leidenuniv.nl/ 
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Fig. 2: Illustration of the band-pass and relative transmission curves of the UV/optical/NIR photometry used for computation of 
photo-z in this work. For clarity, we do not show the WISE band-passes. 


Bands Survey Depth(AB mag) Reference 
various indicators 
FUV, NUV GALEX 19.9, 20.8 Bianchi (2014) 
u,g,ri KiDS 24.2, 25.1, 25.0, 23.7 Kuijken et al. (2019) 
grizy HSC 26.8, 26.4, 26.4, 25.5, 24.7 Aihara et al. (2018b) 
gri LS8 24.0, 23.4, 22.5 Dey et al. (20192) 
z, J,H,K KiDS/VIKING  23.1,22.3,22.1,21.5, 21.2. Kuijken et al. (2019) 
J. Ks VISTA/VHS 21.1, 19.8 McMahon et al. (2013) 


WLW2,W3,WA LSS/WISE 


21.0, 20.1, 16.7, 14.5 


Meisner et al. (2019) 


Table 1: Photometry available for the counterparts identification and photo-z computation. For LS8 the required depth for DESI is 
listed. For Ls8/WISE the listed depth is taken from (Meisner et al. 2019) and it is computed using WISE detected sources. Given 
that the photometry used here is forced photometry at the position of optically detected sources, the depth is higher. 


W1, W2, W3, W4). Afterwards, the survey continued with 
observations only in W1 and W2. The photometry in W1, 
W2, W3, WA from LSS includes all five years of publicly 
available WISE and NEOWISE reactivation (Meisner et al. 
2019) and it is measured using the TRAcTOR algorithm (Lang 
2014) at the position of grz detected sources. 


2.3. Optical spectroscopy 


The eFEDS field has previously been observed by several 
spectroscopic surveys, most notably GAMA, SDSS, WiggleZ, 
2SLAQ, LAMOST. Many of the existing spectra are of high 
enough quality that we can use them for science applications, 
in particular where we just need redshift and basic classification 
(i.e. deciding between star, QSO, or galaxy). However, a careful 
collation and homogenisation of the existing spectroscopy cata- 
logues is first needed to provide a reliable compendium of these 
data. 

The largest body of spectroscopic redshift information 
comes from the SDSS survey (York et al. 2000; Gunn et al. 2006; 
Smee et al. 2013; Abdurro' uf et al. 2021), totalling more than 
68k spectra of 61k science targets within the outer bounds of 
the eFEDS field. We have collected archival public data from 
SDSS phases I-IV (Ahumada et al. 2020), as well as the re- 
sults of the recent dedicated SPIDERS (Spectroscopic identifi- 
cations of eROSITA sources) campaign (Comparat et al. 2020, 
Merloni et al, in prep), within SDSS-IV (Blanton et al. 2017) 
following-up eFEDS X-ray sources. A small team formed from 
among the authors have visually inspected all of the SDSS 1D 
spectra lying in the vicinity of eFEDS X-ray sources, correct- 
ing occasional pipeline failures, and grading the spec-z onto a 
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common normalised quality (NORMQ) scale between 3 and -1. 
One can interpret NORMQ as follows: spec-z having NORMQ=3 are 
those with ‘secure’ spectroscopic redshifts, those with NORMQ=2 
are ‘not secure’ (although a large fraction are expected to be at 
the correct redshift), spec-z with NORMQ=1 are ‘bad’ (e.g. low 
SNR, problematic extraction, dropped fibres), and those with 
NORMQ--1 are 'blazar candidates’. For completeness, we also re- 
tain SDSS-DR16 spectroscopic redshifts in the eFEDS field that 
do not lie near eFEDS X-ray detections, but only when they sat- 
isfy all the following criteria: SN_MEDIAN_ALL>2.0, ZWARNING 
= 0, SPECPRIMARY = 1, 0 < Z_ERR < 0.002. An exhaustive de- 
scription of the SDSS dataset within the eFEDS field will be 
presented separately by Merloni et al. (in prep). 

We also gathered published spectroscopic redshifts and clas- 
sifications (hereafter ‘spec-z’) from the literature where they 
overlap with the eFEDS footprint, with the detailed breakdown 
presented in Table 2. In order to gather spec-z from smaller sur- 
veys that might only contribute a few redshifts each, we have 
also queried the Simbad database (as of 05/03/2021, Wenger 
et al. 2000), in the vicinity of the eFEDS X-ray counterpart po- 
sitions. 

For the purposes of this work, we place greater weight on pu- 
rity rather than completeness. Therefore, where the parent survey 
catalogues include some metric of quality/reliability, we have ap- 
plied strict criteria to retain only the most secure spec-z informa- 
tion. The filtering criteria applied to the original catalogues, and 
the number of spec-z considered from each catalogue listed in 
Table 2. We assume that after these quality filtering steps, all the 
archival spec-z are ‘secure’ (i.e. NORMQ=3), except Simbad for 
which we adopt NORMQ=2, meant in this case to be interpreted as 
‘not yet proven to be secure’. 
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Spectroscopic Survey Quality threshold Nspecz Data Release & Reference 
SDSS see text 46837 up to DR17; Merloni et al., in prep 
GAMA NQ24 26318 DR3; Baldry et al. (2018) 
WiggleZ Q>4 13466 Final DR; Drinkwater et al. (2018) 
2SLAQ q_z2S=1 953 v1.2; Croom et al. (2009) 
6dFGS 4<qcz<6 365 Final DR; Jones et al. (2009) 
2MRS non-null velocity 152 v2.4; Huchra et al. (2012) 
LAMOST snrr > 10, z > -1, 0.0 < z_err «0.002 55866 DR5v3; Luo et al. (2015) 
Gaia RVS non-null velocity 15568 | DR2; Gaia Collaboration et al. (2018) 
Simbad non-null redshift 3915 as of 05/03/2021; Wenger et al. (2000) 
Total unique objects 143637 


Table 2: Spectroscopic redshift measurements considered 
within the eFEDS footprint (126c«RA «146.2 deg, -3.2<Dec<+6.2 deg). Nspecz is the number of spectroscopic redshift 
measurements that pass the quality threshold (applied to columns provided in the originating catalogue). For Simbad, the number 
of entries is limited to objects lying within 3 arcsec of the optical coordinates of counterparts to eFEDS sources. Note that some 
astrophysical objects appear in two or more redshift catalogues. 


All these spec-z were progressively collated into a single cat- 
alogue, with a single redshift and classification per sky position, 
using a match in coordinates between the coordinates listed for 
the nine input spectroscopic catalogs. An appropriate search ra- 
dius (in the range 1-3arcsec) was chosen according to the ex- 
pected positional fidelity and/or fibre sizes associated with each 
input spectroscopic catalogue. After this de-duplication step we 
are left with 143637 unique entries over the eFEDS field, of 
which 108834 are ‘secure’ (i.e NORMQ=3). 


3. Counterparts identification: Methodology 


Because of the large PSF of the eROSITA telescopes and the 
small number of photons associated with typical X-ray detec- 
tions, the 1o rms positional uncertainties of individual X-ray 
sources can be several arcseconds. Specifically, in eFEDS the 
mean positional error is 4/7 and extends above 20 arcsec only 
for a handful of sources (Brunner et al. 2021). For the expected 
optical/infrared magnitude distribution of X-ray sources at the 
depth of the eFEDS (see e.g., Merloni et al. 2012; Menzel et al. 
2016), the sky density of the relevant astrophysical source popu- 
lations is often large. For this reason, the identification of the true 
associations cannot be determined solely by closest neighbour 
searches, as there will be several potential counterparts within 
the error circle of any given X-ray source. Taking this into ac- 
count, the identification of the counterparts of eFEDS point-like 
sources has been performed using two independent methods. 
Nway (Salvato et al. 2019), based on Bayesian statistics, and 
ASTROMATCH (Ruiz et al. 2018) based on the Maximum Like- 
lihood Ratio (MLR; Sutherland & Saunders 1992), have been 
specifically developed to identify the correct counterparts to X- 
ray sources, independently of their Galactic or extragalactic na- 
ture. In order to assess the probability (or likelihood) of an object 
to be the correct counterpart to an eFEDS sources, the two meth- 
ods take first into account the separation between the sources, 
their positional accuracy and the number density of the sources 
in the ancillary data. The difference between the methods resides 
then in the adoption of specific features able to distinguish an X- 
ray emitter (regardless its Galactic or extragalactic nature) from 
a random source in the field. Both methods determined the fea- 
tures (priors) using a representative training sample constructed 
using secure counterparts to X-ray sources detected in 3XMM. 
In the case of Nway the prior is determined also by comparing the 


features of the sources in the training sample with the features 
of the field sources present within 30arcsec from the 3XMM 
sources. The disentangling power of the priors was then tested 
on a blind validation sample of 3415 Chandra sources with se- 
cure counterparts, where the accuracy of the Chandra position 
was made "eFEDS-like" (see Subsection 4.1). The detailed de- 
scription of the construction of training, validation and respec- 
tive fields sample is presented in Appendix A. Here we provide 
a short description of nway and AsrRoMarcH and how their re- 
spective priors have been determined. 


3.1. nway enhanced with photometric priors defined via 
machine-learning 


In addition to astrometry, i.e. the separation between an X-ray 
source and a candidate counterpart, the associated positional un- 
certainties and the number densities of the sources in the two 
catalogs, the photometry of potential counterparts is valuable in- 
formation to determine whether or not they are associated to a 
given X-ray detection. Traditionally, the likelihood ratio associ- 
ated to angular distance was multiplied by a factor accounting for 
the magnitude distributions and the sky density of a population 
of X-ray sources and background objects (e.g Brusa et al. 2005, 
2007; Luo et al. 2010). In nway, this idea was re-formulated in 
the Bayesian formalism, in the following way. 

Given some data D, the posterior association probability 
P(H | D) is related to the prior probability P(H) via the like- 
lihood P(D | H), P(H | D) « P(H) x P(D | H). P(H) is com- 
puted from the source densities in each catalogue. If photometric 
information (or any other feature, in fact) is used, then the like- 
lihood becomes: P(D | H) = P(D, | H) x P(D, | H) where 
D, and D,, refer to the astrometric and photometric informa- 
tion, respectively. For any possible association, the modifying 
factor P(D,, | H) is computed from the feature (e.g., magnitude 
or colour) m of the counterpart candidate and from the expected 
distribution of this observable for X-ray sources and field (non 
X-ray) sources. We call such factors "priors" to Nway, as they 
enter as a priori information in the ultimate matching process. 
These priors are posteriors previously learned from other data. 
For further details on the formalism we refer to Salvato et al. 
(2019) and the nway documentation®. 


6 https: //github.com/JohannesBuchner/NWAY 
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In order to take full advantage of the LS8 ancillary catalogue, 
we have extended this approach for the eFEDS counterpart iden- 
tification. Instead of using a subset of magnitudes, colours and 
their associated distributions, we have trained a Random Forest 
classifier (sklearn implementation, Pedregosa et al. 2011) on 
a large number of features to reliably map the available Legacy 
DR8 information to real X-ray sources and real field objects the 
details of which are described in next subsection. The trained 
classifier is then used to predict the probability of all counter- 
part candidates to be X-ray emitting taking into account also the 
spatial information, as described in subsection 3.1.2. This prob- 
ability is directly used to compute P(D,, | H). In the following 
section, we describe the definition of the features in the training 
sample. 


3.1.1. Random Forest prior: training and performance 


From the 3XMM training sample described in Appendix A, we 
have extracted a set of photometric and astrometric features. 
The training features are listed and described in Table 3. X-ray 
sources are flagged as target class "1", field objects as target class 
"0". About 15% of the 3XMM training samples (61821 sources) 
are randomly extracted for testing purposes and not further con- 
sidered in the training procedure. The baseline model is com- 
posed of 200 trees, allowing decision split points if at least 8 
samples are left in each branch. All of the 22 features can be 
used for the decision-tree building, which makes use of boot- 
strap samples of the training set. 

By construction the training sample is highly imbalanced, 
since the field objects strongly outnumber the X-ray sources. We 
therefore opted for a weighting scheme, automatically adjusting 
weights of training examples for the class imbalance. 

The trained model is evaluated on the test set, resulting in the 
confusion matrix presented in Figure 3. We note that the cut in 
the class prediction for the presented confusion matrix is made 
at Px-ray = 0.50, where px-ray is the predicted probability that 
a counterpart candidate is X-ray emitting. Since Nwav uses the 
continuous predicted probability as modifying factor for the like- 
lihood P(D | H), real counterparts with rare or untypical pho- 
tometric features, i.e with px ;ay S 0.50, may still be selected 
by the algorithm if the astrometric configuration favours them. 
We obtain a high recall fraction of 2585/(2585 + 457) = 85%, 
while the fractional leakage of contaminating field objects re- 
mains low: 738/(738 + 58041) = 1926. We note that px 4 is only 
computed from the photometric and proper-motion properties of 
the LS8 sources. In particular it does not depend on coordinates 
and positional uncertainties. As discussed in the previous sec- 
tion, this allows us to split the likelihood of a match into inde- 
pendent astrometric and photometric terms: P(D | H) = P(D, | 
H) x P(D,, | H). P(D,, | H), the photometric term, is directly 
related to px jay. 


Feature Description 


flux */mw. transmission, * 
gaia phot * mean mag 


deredenned flux in g,zz W1,W2 
original GAIA phot. in G, Gpp, Grp 


snr * S/N for g,rz W1, W2, G, Gop, Grp 
Jpmra? + pmdec? Gaia proper motion 
parallax Gaia paralllax 


2-5 r-z, z-W1, r-W2 dereddened colours 


Table 3: Final list of LS8 training features used to model the 
photometric prior in NWAY. 
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Fig. 3: Confusion matrix resulting from the random forest pre- 
diction on an independent test set. X-ray sources are labelled as 
"Real X-ray" while field objects as "Field". Numbers on the right 
downward diagonal correspond to correctly predicted classes. At 
this step the separation between the sources and the X-ray posi- 
tion are not considered. 


3.1.2. nway association run 


Using the trained model, we predict px-ray for all LS8 sources 
in the eFEDS field. We then run the nway matching procedure 
using the ratio px ;ay/(1 — Px-ray) for P(Dm | H). This is done 
by adding px-ray as a column to the LS8 catalog and activat- 
ing it as a prior column in Nwav with the -mag option. We set 
a radius of 30 arcsec from each eFEDS X-ray source, consider- 
ing all LS8 sources within this radius. This may appear to be a 
relatively large maximal separation, given the mean eFEDS po- 
sitional error of 4.5 arcsec ; however, we want to account also 
for the largest positional uncertainties of a few objects in the 
eFEDS source catalogue (see Brunner et al. 2021) and the use 
of a large search radius minimises the probability of missing 
counterparts that are widely separated from the X-ray centroid 
position. The sky coverage of eFEDS and LS8 are respectively 
140 deg? and Nerens x x x (307? — Aoverlap Where Nerens is the 
number of eFEDS sources (point-like or extended) and Aoyertap 
the overlap area of neighbouring search windows around the 
X-ray sources. As described in the Appendix of Salvato et al. 
(2019), the area coverages are used for computing the number 
densities, which in turn are used in the computation of the prob- 
ability for an eFEDS source to have a counterpart (p any) and 
the probability for each source in LS8 to be the right counterpart 
(p. i). These are the two quantities that are then used for assign- 
ing a counterpart to an eFEDS source: while the LS8 source with 
the highest p i is considered to be the best available counterpart, 
we use the p any value to decide whether the identification of 
the counterpart is reliable (see 4.2 for details). 


3.2. The MLR approach 


The maximum likelihood ratio (MLR) statistic for the correct 
pairing of sources from multiple catalogs was introduced in the 
seminal work of Sutherland & Saunders (1992) and is widely 
used, although mostly pairing only two catalogs. For sources de- 
tected at two different wavebands and separated by angular dis- 
tance r on the plane of the sky, the likelihood ratio provides a 
measure of the probability that the two sources are true counter- 
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parts normalised by the probability that they are random align- 
ments. Quantitatively this is estimated as: 


_ a) fi) 
nmi) ' 


where q(71) is the prior knowledge on the properties of the true 
associations, such as the distribution of their apparent magni- 
tudes at given spectral window, their colours and/or the spatial 
extent of the observed light in a given waveband. The collection 
of all possible source properties for which a prior probability can 
be estimated is represented by the vector m. The quantity n(m) 
is the sky density of all known source populations in the parame- 
ter space of 7i. It measures the expected contamination rate from 
background/foreground sources that are randomly projected on 
the sky within distance r off a given position. The probability 
that the true associations are separated by distance r is measured 
by the quantity f(r). This depends on the positional uncertainties 
of matched catalogues. 

For the MLR applied to the eFEDS X-ray sources a multi- 
dimensional prior is used that combines knowledge of the optical 
and mid-infrared colours/magnitudes of X-ray sources as well as 
their optical extent, i.e. point-like vs extended. 

The version of MLR applied to the eFEDS work is based 
on the AsrRoMArCH/ implementation. This tool has been specif- 
ically designed to deal with the complexity of wide-area sur- 
veys that contain a very large number of sources. The HEALPix 
Multi-Order Coverage map (MOC?) technology is used to de- 
scribe the footprint of a catalogue of astrophysical sources. The 
KD-tree library as implemented in the AsrRoPv package (As- 
tropy Collaboration et al. 2013, 2018) is used to accelerate spa- 
tial searches of potential counterparts within a radius r of a 
given sky position. The core AsrRoMarCH functionality is ex- 
panded to enable the use of multidimensional priors. The ver- 
sion of AsrROMATCH adopted in this work is therefore a fork 
(github.com/ageorgakakis/astromatch) of the main de- 
velopment branch. 

Like for Nwav, the optical counterparts are investigated out 
to a maximum radius of 30 arcsec. We assume that the positional 
uncertainties of the X-ray and optical catalogues follow a nor- 
mal distribution. The quantity f(r) is therefore represented by a 
Gaussian with c parameter estimated as the sum in quadrature 
of the X-ray and optical positional uncertainties. 

The priors are generated using the training sample defined in 
Appendix A.1. The LS8 photometric properties of the sources 
in that sample were explored to identify parameter spaces, in 
which they separate from the general LS8 field population. After 
some experimentation we opted for the following 3 independent 
priors: 


— A space that includes the WISE colour W1 — W2, the WISE 
magnitude W2 and the optical extent of a source. For the 
latter we use the LS8 parameter rvPE, which provides in- 
formation on the optical morphology of sources. In our ap- 
plication we only differentiate between optically unresolved 
(TYPE="PsF") and optically extended (rvPEZ"Psr") popula- 
tions. 

— A space that includes the optical/WISE colour r — W2, the 
optical magnitude g and the optical extent of a source. For the 
latter we use the Legacy-DR8 parameter TYPE as explained 
above. 


LR (1) 


7 https: //github.com/ruizca/astromatch 
8 https://www.ivoa.net/documents/MOC/ 


— The distribution of the Gaia G magnitudes listed in the LS8 
catalogues. This is to identify X-ray sources associated with 
very bright counterparts. 


The distribution of the training sample sources in the pa- 
rameter spaces above is used to define two 3-dimensional and 
one 1-dimensional independent priors. These are provided as in- 
put to the AsTRoMArCH code, together with the distribution of the 
sources in the field population when computing the association 
for eFEDS. For a given eFEDS source all the potential associ- 
ations within the search radius of 30 arcsec are identified. Each 
of them is assigned one LR value for each of the 3 priors using 
Equation 1.The LS8 source with the highest value of LR from 
one of the three priors is considered to be the counterpart. 


4. Comparing nway and AsrRoMarcH on a validation 
sample 


In order to compare completeness and purity of Nwav and AsTRO- 
MATCH, the same setting adopted for identifying the counterparts 
to eFEDS has been used for determining the best counterparts 
to a blind, validation sample of 3415 counterparts to Chandra 
sources (see Appendix A). This validation sample was used as 
a truth table to test the performance of Nwav and ASTROMATCH 
for finding counterparts and to define the p. any and LR. BEST 
thresholds above which a counterpart is considered secure. 


4.1. The eROSITA-like validation sample 


The Chandra sources were assigned eROSITA positional errors 
by randomly sampling from the astrometric uncertainties listed 
in the core eFEDS source catalogue. We account for the flux de- 
pendence of these uncertainties by matching any given Chandra 
source with a certain flux from 0.5-2 keV to only those eFEDS 
sources with similar 0.6-2.3 keV flux within a margin of 0.5 dex. 
The flux transformation between the Chandra and eFEDS spec- 
tral bands is small, e.g. about 2% for a power-law spectral en- 
ergy distribution with I = 1.9, and is ignored. The positional 
uncertainty, o, assigned to each of the Chandra sources can be 
split into a right-ascension and a declination component. It is as- 
sumed that these two uncertainties are equal and therefore RA 
= óDec = o/ N2. Under the assumption that both the óRA and 
óDec are normally distributed, the total radial positional uncer- 
tainty follows the Rayleigh distribution with scale parameter c. 

Instead of directly using the assigned o as the astrometric 
error to be applied to the Chandra positions to make them re- 
semble the eFEDS astrometric accuracy, we prefer to add further 
randomness to the experiment. For each Chandra source the as- 
signed ø is treated as the scale factor of the Rayleigh distribu- 
tion and a deviate is drawn, which represents the positional error. 
This is applied to the sky coordinates of the optical counterpart 
of the Chandra source and the new offset position is taken as 
the centroid of the X-ray source in the case of an eFEDS-like 
observation. 


4.2. Probability thresholds definition 


The identification of the LS8 counterparts to the Chandra 
eFEDS-like sources was performed by Nway and ASTROMATCH 
using the same setup adopted for the real eFEDS observation. 
The resulting catalogue of best counterparts was matched with 
the true associations, providing a direct comparison between the 
methods, and, at the same time providing a measure of the false- 
positive identification rate of the eFEDS counterpart catalogue. 


Article number, page 7 of 32 


A&A proofs: manuscript no. AandA. Salvato 2021 


1.00 


Completeness 


—— Purity 


c 

\O 

o6 
1 


/completeness 
o > 
ND kel 
nmn 


purity 
e 
Nel 
n 


T T 
0.06 0.08 


0.04 
p_any 


0.10 


Completeness 


— Purity 


purity/completeness 
o o o o 
Nel Nel kel O 
N + nN oo 
Li Li 1 Li 


04 06 0:8 L0 
LR BEST 


0.904 T 
0.0 0.2 


Fig. 4: Purity (blue solid line) and completeness (red solid 
line) as a function of p_any for the association of the CSC2 
eROSITA-like validation sample made with nway (top panel) and 
as a function of LR. BEST, for the asrromatcu (bottom panel). 


First, we compared the primary identifications returned by 
Nway and AsTRomatcH to true identifications stored in the valida- 
tion sample. nway and AsrRoMarchH identify correctly 3216/3394 
(95%) and 3024/3394 (89%) of the sources, respectively. Nwav 
has a higher success rate. Additionally, NWAY has a smaller 
fraction of sources with a second possible counterpart (115 
sources against 367). Another way to look at the results is to 
compare purity and completeness for the two methods. At any 
given value of p. any/LR. BEST we define as purity the fraction 
of sources with the correct identification. In addition we define 
as completeness the fraction of sources for which we can assign a 
counterpart (see Figure 4). Both methods have very high purity 
and completeness, with Nway providing a sample that is purer, 
consistent with the fact that very few sources have a second pos- 
sible counterpart, in addition to the correct one. This, combined 
with the success rate, makes Nwav the more robust method for 
determining the counterparts. Its strength comes first of all from 
the capability to account for complicated priors involving multi- 
ple features (essentially resembling an entire SED, together with 
other physical properties), from different catalogs at the same 
time. Furthermore, the Bayesian statistics upon which Nway is 
based, also allows accounting for sources that are lacking one or 
more of the features. 


Similarly to what is traditionally done in Maximum Likeli- 
hood (see e.g., Brusa et al. 2007), the intersection between the 
completeness and purity can be used for defining a threshold 
above/below which the counterparts is considered reliable. This 
corresponds to 0.035 for p. any and 0.45 for LR. BEST. 
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Sample Number | Counterparts 

| "same" "different" 
DET LIKE 0» 6 27369 | 24193 3176 (11.696) 
DET LIKE 0» 8 21410 | 19162 1795 (8.4%) 
DET LIKE 0» 10 17574 | 16435 1136 (6.596) 


Table 4: Comparison of matches between Nway and ASTROMATCH. 
In the last column the fraction of the "different ctps" with respect 
to the whole sample is also reported. 


5. Determination of the counterparts to eFEDS 
sources 


While for the large majority of the cases the two methods select 
the same counterparts, there are cases where they do not agree 
or where they identify multiple likely associations. In the follow- 
ing we describe the procedure that we have adopted for the final 
assignment of the counterparts. 

Then, after the consolidation of the counterpart, we describe 
a further test for consistency that was done by comparing the 
results of the association with an independent method, HamStar 
(Schneider et al., 2021), which is tuned for the identification of 
Galactic coronal X-ray emitters (Section 5.3). Important to note 
is that the same process is then repeated for the 246 sources in 
the eFEDS hard point-source catalog. From now on, all numbers 
and descriptions are given for the main sample, unless specified 
otherwise. 


5.1. Comparison of counterparts from Nway and AsTROMATCH 


For 24193/27369 (88.4%) eFEDS point like sources in the main 
sample, Nway and ASTROMATCH point at the same counterpart and 
disagree for 3176 (11.6%) of the cases. The numbers are quoted 
at this stage regardless on the p. any or LR. BEST thresholds, that 
will instead be used later in order to assign a flag for the quality 
of the proposed counterpart. 

Table 4 summarises the number of eFEDS sources with the 
agreement/disagreement between the two methods as a function 
of detection likelihood of the X-ray source. Sources with low 
detection likelihood values have, on average, larger X-ray po- 
sitional errors and a larger number of spurious sources is ex- 
pected from simulations (Brunner et al. 2021; Liu et al. 2021b). 
Itis therefore not surprising that the largest discrepancies are ob- 
served at the lowest detection likelihoods (Figure 5). In fact, the 
disagreement drops from 11.6% to 6.5% when considering only 
eFEDS sources with DET. LIKE grater than 10, suggesting that 
at low detection likelihood a fraction of eFEDS sources might 
be spurious detections where Nway and ASTROMATCH assign a dif- 
ferent "field" source. The notion that these are "field" sources is 
also supported by the fact that for about 5096 of eFEDS sources 
with DET. LIKE below 10 and with different counterparts, both 
p_any and LR. BEST are below threshold. 

Table 5 summarises the comparison between the two meth- 
ods also taking into account the reliability of the associations. 
In this table we further split the sample with the same counter- 
parts ("same ctps" for brevity) in two subsamples: one for which 
the proposed counterparts are the only associations suggested by 
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Fig. 5: Number of sources as a function of detection likelihood 
for the entire sample of eFEDS Main catalog sources (black his- 
togram) and for the sources with nway and ASTROMATCH indicating 
different counterparts (grey shaded histogram). 


both methods ("single solutions"; 86.3% of the entire sample) 
and one for which, although both methods point to the same as- 
sociations, at least an additional counterpart at lower significance 
exists from at least one methods ("multiple solutions; 2.196 of the 
entire sample). 

The different priors and the different methods used for as- 
signing the counterparts explain the selection of different coun- 
terparts in the "different ctps" sample. AsrROMArTCH uses three 
priors, but they are used each independently and for any given 
eFEDS source the counterpart is assigned by the prior with the 
higher probability. Instead, nway uses all the features at the same 
time and the best counterpart is the one that mimics best the 
training sample in a multidimensional space. We consider this 
second method more reliable and for this reason we decided to 
list always as primary the counterpart suggested by nway, unless 
LR. BEST is above threshold and p. any is not. 

Interestingly, we note that, in the "different ctps" sample, for 
about 25% of the cases the primary counterpart assigned by one 
method is the secondary counterpart assigned by the other. 


5.2. Assigning a quality to the proposed counterparts 


As a consequence of the discussion above, each counterpart in 
the catalog has been flagged as following ([number] refers to the 
number of sources in the category): 


— CTP. quality-4: when nway and AsTROMATCH agree on the 
counterpart and both p. any and LR. BEST are above thresh- 
old [20873 sources; black in Table 5]; 

— CTP_quality=3: when nway and AsTROMATCH agree on the 
counterpart but only one of the methods assigns the counter- 
part with a probability above threshold [1379 sources; blue 
in Table 5]; 

— CTP. quality-2: when there is more than one possible re- 
liable counterpart. This includes a) all the sources in the 
"different ctps" sample with at least one probability above 
threshold and b) the sources in the "same ctps" sample with 
possible secondary solutions [2522 sources in total; cyan in 
Table 5]. This latter case, due to the low spatial resolution 
of eROSITA, implies that both sources are contributing to 
the X-ray flux. A supplementary catalog with the properties 
of the secondary counterparts for these 2522 sources is also 
released (Section 8). 

— CTP_quality=1: when nway and AsTROMATCH agree on the 
counterpart but both p. any and LR. BEST are below thresh- 
old [1370 sources; purple in Table 5]; Note that a probability 
below threshold does not necessarily imply a wrong counter- 
part. It could also indicate that the counterpart is correct but 


"Same ctps", single solutions — 23622 (86.396) 


p-any p-any 

20.035 «0.035 
LR best» 0.45 | 20873 (0.763) | 561 (0.020) 
LR best« 0.45 | 818 (0.030) 1370 (0.050) 


"Same ctps", multiple solutions — 571 (2.1%) 


p-any p-any 

20.035 «0.035 
LR. best» 0.45 505 (0.018) 7 (3x10~*) 
LR best« 0.45 59 (0.002) - 


"Different ctps" — 3176 (11.696) 


p-any p-any 

20.035 «0.035 
LR best» 0.45 | 1243 (0.045) 226 (0.008) 
LR best« 0.45 478 (0.017) 1225 (0.045) 
HamStar! 4 (2x107^) - 


Table 5: Counterparts quality summary. Comparison of matches 
with nway and AsTROMATCH as a function of their respective 
thresholds, split between "same counterparts" (for both cases 
of single and multiple counterparts) and "different counterparts" 
classes; in parenthesis the fractions of the total sample. The 
numbers in each box are colour-coded by their CTP. quality 
value (see text for more details): Thick black = 4; thick blue 
= 3; thick cyan = 2; thick purple = 1 and thick red = 0. ! 
Objects for which HamStar would point to a different coun- 
terpart with p_stellar>0.95 are by definition sources with 
CTP_quality=2 (see text for details). 


its features are not sufficiently mimicking those in the train- 
ing sample. 

— CTP_quality=0: when nway and astromatcu indicate dif- 
ferent counterparts and both p_any and LR_BEST are below 
threshold [1225 sources; red in Table 5]. 


Counterparts with quality 4,3,2 are considered reliable (90.5% of 
the main sample and 93.9% of the hard sample), while sources 
with quality 1 or O are considered unreliable (9.5% of the main 
sample and 6.1% of the hard sample). 


5.3. Comparison with an independent association method 
tuned to stars: HamStar 


The content of the eFEDS point-source catalog has also been 
analysed in order to specifically identify stellar coronal X-ray 
emitters with sufficiently well-defined properties. This method, 
called 'HamStar' in the following, is based on the properties ex- 
pected for this type of star; the details are presented in (Schneider 
et al. 2021). In short, HamStar performs a binary classification 
between stellar coronal emitters and other objects. This classifi- 
cation is based on the concept of eligible stellar counterparts, i.e., 
the match catalog contains only stellar objects that may reason- 
ably be responsible for the X-ray sources. Specifically, the par- 
ent sample that HamStar uses includes only sources from Gaia 
EDR3 that: 


— are brighter than 19th magnitude in G band (implied by the 
stellar saturation limit of Ly/Lyy x 10^? and the depth of 
eFEDS); 
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— have accurate magnitudes in all three Gaia photometric 
bands (to apply colour-dependent corrections); 

— have a parallax value at least three times larger than the par- 
allax error (to select only genuine stars). 


Then, a positional match between sources in eFEDS and the 
eligible stellar candidates is made, considering all sources within 
5o of the positional uncertainty of the eFEDS source as pos- 
sible stellar counterparts. Finally, the matching probabilities of 
all possible counterparts are adjusted based on the value of the 
two dimensional Bayes map at the counterpart?s Bp-Rp colour 
and ratio between X-ray to G-band flux. With the HamStar algo- 
rithm, 2060 eFEDS sources are expected to be stellar (Schneider 
et al. 2021). The vast majority of them have a unique Gaia coun- 
terpart, and only 83 eFEDS sources have two possible eligible 
counterparts. 

Of the 2060 eFEDS sources with a counterpart from Ham- 
Star, 1883 have the counterpart identified in this work that is less 
than 2 arcsec from the counterpart proposed by Hamstar and we 
assume to be the same source. We visually inspected the cutouts 
of the 29 sources for which the separation between the coun- 
terpart proposed by Hamstar and this work is between 2 and 
3 arcsec, and concluded that for 9 sources the counterparts are 
the same but the sources are heavily saturated in LS8 so that the 
coordinates are not sufficiently precise. This corresponds to an 
92% agreement; incidentally, this value corresponds almost ex- 
actly to the expected reliability and completeness of HamStar 
(Schneider et al. 2021). All these sources will be then classified 
as "Secure Galactic" in Section 6. 

HamStar applies well-understood X-ray-to-optical proper- 
ties of stars to a well-defined subsample of Gaia sources. On the 
other hand, the training samples used by Nway and ASTROMATCH 
include various classes of X-ray emitters: stars and compact ob- 
jects, AGN and galaxies, including the bright ones at the centre 
of clusters (BCG). We considered the prior defined by Nway and 
ASTROMATCH More representative of the population of X-ray emit- 
ters at large and decided to keep the counterpart assigned in the 
previous section rather than changing counterparts for the 177 
sources for which the methods point to different counterparts. 
However, we degrade the CTP_quality, because of the pres- 
ence of an alternative solution. Interestingly only 4/211 sources 
were considered secure, with CTP_quality==3, while all other 
counterparts had already a low CTP_quality. 


5.4. Separation and magnitude distribution of the 
counterparts 


For 24427/247714 (98.5%) of the sources having 
CIP. quality22, the separation between the X-ray posi- 
tion and the assigned LS8 counterpart is smaller than 15 arcsec, 
with a mean of 4.3 arcsec. As might be expected, there is a trend 
for larger average X-ray-optical separations at smaller values 
of DET LIKE; lower detection likelihood sources typically 
have larger X-ray positional uncertainty (see Brunner et al. 
2021). The distribution of the observed X-OIR separations 
normalised by the X-ray positional uncertainty is shown in 
Figure 6 as function of the r magnitude of the counterpart. 
The distribution is broadly comparable to the expectation of a 
Rayleigh distribution with scale factor — 1. 

In Figure 7 we show the distribution of the sample in 
X-ray flux versus optical magnitude space, with the sample 
subdivided into those objects with more secure counterparts 
(CTP. quality22), and those with less reliable counterparts 
(CTP_quality<1). The less reliable counterparts tend to have 
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Fig. 6: Separation between X-ray position and the selected 
counterpart normalised by the 1-dimensional positional error 
of the X-ray source, as a function of its r band magnitude for 
the sources with secure counterparts (CTP. quality22). The 
hexagons are colour coded linearly according to the counts in the 
specific bins. The marginal histograms have linear y-axis. The 
90-th percentile of the r-band magnitude distribution (22.88) and 
the median of the normalised X-OIR separation (1.22) are also 
reported in the marginal 1D histograms. The 1c: Rayleigh distri- 
bution expected for the normalised separations is overplotted in 
green. 


fainter optical magnitudes for a given X-ray flux than the more 
secure counterparts. 


6. Source Characterisation and Classification 


After the identification of the counterparts, to understand phys- 
ical processes and populations, the different classes of objects 
need to be separated. The most important separation is be- 
tween extragalactic sources (galaxies, AGN, QSOs) and galac- 
tic sources (stars, compact objects, etc.). In the following, we 
describe how the classification of the sources was done and the 
validation tests performed. 


6.1. Galactic and extragalactic sources 


In order to classify sources in the most reliable way, we have 
used a combination of methods and various information: spec- 
troscopic, parallax measurements from Gaia, colours and mor- 
phology from imaging surveys. None of the methods is infalli- 
ble because they all depend on the quality of the data (e.g., SNR 
for spectra, depth and resolution of images) and because of the 
degeneracy in colour-redshift space for many of the sources. We 
have therefore adopted a multi-step approach: at each step we 
extract from the pool of sources those that can be classified with 
high reliability either as extragalactic or Galactic. Figure 8 pro- 
vides a graphical illustration of the decision tree adopted for the 
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Fig. 7: Magnitude distribution of the counterpart vs the X-ray flux for sources with CTP_quality>= 2 (left) and CTP_quality< 2 


(right). A cut at log ML FLUX s > —20 has been applied. 


The magnitude distribution is clearly different. The green lines represent the median 0.5-2 keV flux (-13.99 and -14.17 
respectively). The red lines mark the 90-th percentile of the r-band magnitude distribution (22.88 and 24.13, respectively). 


classification, together with the number of sources in each the 
classes. The procedure is described below in detail. 

We first apply the classification based on spectroscopy or 
large parallax. These can be considered primary methods as they 
are highly pure, but certainly not complete. The sources thus 
classified are defined "Secure Galactic" or "Secure Extragalac- 
tic". Briefly, we define "Secure Extragalactic" all sources with 
spectroscopic redshift >0.002 and NORMQ=3 (STEP 1 in Figure 8) 
and "Secure Galactic" all sources satisfying at least one of the 
criteria: 1) spectroscopic redshift «0.002 and NORMQ=3; signif- 
icant parallax from Gaia EDR3 (above 30) or agreement with 
Hamstar counterparts (STEP 2). 

Next, from the sources still in the pool, we extracted those 
that appear extended in the optical images. Depending on 
whether or not photometry from HSC is available, a source is 
defined as extended (EXT) if it satisfies: 


Amag = magxron — magpsr > 0.1 


(2) 
simultaneously in g,7i,z from HSC imaging data (e.g., Palanque- 
Delabrouille et al. 2011), or, when no photometry from HSC is 
available (either because the source is outside the field or be- 
cause of saturated photometry): 


LS8_TYPE z PSF. (3) 


The EXT sources were then flagged as "Likely Extragalac- 
tic" (STEP 3). This is considered a secondary classifier, given 
that, for example, in poor seeing conditions point-like sources 
(or stellar binary systems) would also be mis-classified as ex- 
tended (see discussion presented in Hsu et al. 2014). 

The sources classified as "Secure" were then projected in the 
LS8 z-WI vs g-r plane (see inset in the left panel of Figure 9), 
following Ruiz et al. (2018). There, we have empirically defined 
a line separator described as: 


z-W1-08*(g-—r)+1.2 =0, (4) 


which provides a sharp separation between secure Galactic and 
extragalactic sources, with a negligible fraction of secure ex- 
tragalactic sources lying below the separator (left panel of Fig- 
ure 9). Then, for all the sources still in the pool and with available 
photometry from LS8 (STEP 4), we classify the sources below 
the line as "Likely Galactic" (STEP 5). The remaining sources in 
the pool with available LS8 photometry are classified as "Likely 
Galactic/Extragalactic" (STEP 6) depending on whether they fall 
below or above the line in the W/ vs X-ray flux plane(see inset 
in the right panel of Figure 9), as defined in Salvato et al. (2019): 
W1 + 1.625 + log(Fo.5-2key) + 6.101 = 0, (5) 
with W1 in Vega system and X-ray flux in cgs. Originally, a sim- 
ilar line separator was introduced by Maccacaro et al. (1988) us- 
ing X-ray and optical bands and over time tested at different X- 
ray flux depth or at different wavelength (e.g., NIR; see Civano 
et al. 2012). This new line separates "X-ray bright’ AGN from 
"X-ray faint’ stars, and was constructed combining data from the 
deep COSMOS Chandra Legacy survey (Marchesi et al. 2016) 
and ROSAT/2RXS (Boller et al. 2016; Salvato et al. 2019). It can 
be considered a good separator only after the extended, nearby 
extragalactic sources are taken into account (see right panel of 
Figure 9). It has the advantage of generality, as the W/ photome- 
try and the X-ray fluxes are available virtually for all the eFEDS 
sources. Finally, for the sources without complete information 
from LS8, we assume they are extragalactic (STEP 7), unless 
they are below the W1-X line defined in Equation 5 (STEP 8). 

In this manner, a simple but reliable four-way classification 
scheme (Secure/Likely Galactic/Extragalactic) is achieved. The 
final distribution of the four classes of sources in the g-r-z-W] vs 
WI-X planes is shown in Figure 10. The two line separators iden- 
tify four wedges, two of which can be used for defining almost 
100% pure sub-samples of Galactic/extragalactic Xray selected 
sources. The four wedges are: 
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Notes/Definitions: 
[1] "Secure" spec z is defined by NORMQ=3 
[2] Parallax/Parallax error » 5 
Step 0: [3] Extended is defined as (g diff & r diff & i diff & z diff) » 0.1 in HSC bands or 
Assign CTP. quality LS8_TYPE!=PSF when HSC is not available 
= [4] W1-2.699 = -1.625 * log10(F 0.5-2 keV)-8.8 
[5] LS8 z-W1 = 0.8*(LS8 g-LS8 r)-1.2 


Step 2: 
Step 1: Secure spec 2«0.002? [1] 
Has the counterpart OR 
secure [1] Significant Parallax? [2] 2566 
spec z»0.002? OR 
Coincide with HamStar? 


o 19426 "n 
Is the CTP extended? 


5377 


11856 
Step 4: 
Are LS8 grz and W1 
available? 
Below W1-X-ray ratio line? 3 Likely Galactic Likely Galactic 
[4] 
11738 377 


Step 5: 
Below the grzW1 line? 
[5] 
11361 
Below W1-X-ray ratio line? 
[4] 
11331 
9. Likely Extragalactic 


Fig. 8: Decision tree adopted for assigning each eFEDS point source to the Galactic or extragalactic classes. First we classified the 
sources on the basis of the most secure methods (e.g., high confidence redshift) and then proceeded with less reliable methods (e.g., 
based on colours) on the sources remaining in the pool, creating less pure samples. Note that the numbers listed at each step include 
all sources, i.e., including also those with an insecure counterpart (CTP_quality<2) 


— Top left: 724 sources, out of which 637 (87.9%) are Galactic — — Bottom left: 1391 sources, out of which 1373 (98.7%) are 
(463 and 428, respectively, only considering sources with Galactic (1337 and 1319 for CTP_quality>=2) 
reliable counterparts, CTP_quality>=2) 


— Top right: 23874 sources, out of which 23809 (99.7%) are — — Bottom right: 1380 sources, out of which 479 (34.7%) are 
extragalactic (21711 and 21647 for CTP_quality>=2) extragalactic ( 1263 and 379 for CTP_quality>=2). 


Article number, page 12 of 32 


Salvato et al.: Point-like sources in eFEDS 


It is important to keep in mind that the order of the steps 
taken in the decision tree is crucial for limiting the mis- 
classification of the sources as much as possible. For example, 
the use of spectroscopic redshift in the first step allowed the 
identification of the bright and nearby extragalactic sources that 
would have been mis-classified as Galactic, in the W/-X plane. 
Similarly, the adoption of the high parallax from Gaia allowed 
the identification for secure galactic sources that would have 
been mis-classified as extragalactic in the z-W/ vs g-r plane. 

In summary, the eFEDS Main sample comprises 24393 
sources classified as Extragalactic (5377 "secure" and 19016 
"likely") and 2976 classified as Galactic (2566 "secure" and 410 
"likely". All these numbers are reported in Table 6. 


6.2. Validation of classification using external samples 


We have carried out sanity checks of the classification frame- 
work against two external catalogues, whose members are ex- 
pected to be almost completely extragalactic in nature: the Faint 
Images of the Radio Sky radio component catalogue (FIRST, 
White et al. 1997), and the Gaia-unWISE AGN candidate cata- 
logue (GUA, Shu et al. 2019). Whilst the GUA sample is derived 
from similar underlying datasets to those used in our own source 
classification logic, the machine learning methods used by Shu 
et al. (2019) make this at least a semi-independent test sample. 
Simple positional matches were made against our best matching 
optical counterpart positions, with a search radius of 3 arcsec for 
the FIRST radio component catalogue’, and 1 arscec for GUA 
(GUA objects were considered when they had PROB RF > 0.8). 
We examine the rate at which sources we classify as Galactic or 
Extragalactic (both Secure and Likely) are matched to objects in 
these external catalogues (see Table 6). There is a very low rate 
of apparent disparities between our classifications and those that 
may be derived by matches to the external catalogues. For exam- 
ple, only 0.19% of “Secure Galactic’ sources have a radio coun- 
terpart in FIRST, compared to 7.0% of the ‘Secure Extraglactic’ 
sample. Likewise, only 0.19% of ‘Secure Galactic’ sources are 
matched to candidate AGN from Shu et al. (2019), compared to 
54% of the ‘Secure Extragalactic’ sub-sample. 


Ref. Total Extragalactic Galactic 

sample matches | Secure Likely | Likely Secure 
All eFEDS 27369 5377 19016 410 2566 
FIRST 796 376 414 1 5 
GUA 6357 2924 3425 3 5 


Table 6: Comparison of our classification scheme against two 
(semi-)independent reference catalogues: the FIRST radio com- 
ponent catalogue (White et al. 1997), and the Gaia-unWISE 
AGN candidate catalogue (GUA, Shu et al. 2019). The low rate 
at which our classification logic classifies both radio sources and 
AGN candidates as being “Secure Galactic’ or “Likely Galactic’ 
suggests that our classifications are robust. 


6.3. Very nearby galaxies 


Unlike what happens in pencil-beam surveys, within eFEDS 
there are numerous very nearby and thus resolved galaxies. 


? we only considered the radio components and made no attempt to 
handle complex sources appropriately 


Vulic et al. (2021) searched for eFEDS sources within the D25 
ellipse of the sources in the Heraklion Extragalactic CATa- 
logue (HECATE) of nearby galaxies (Kovlakas et al. 2021). 
For the 100 HECATE galaxies with an eFEDS source nearby, 
93/100 are consistent with the counterpart proposed here by the 
combination of Nway ASTROMATCH. For the remaining 7 cases 
(ID SRC 7551, 12847, 2671, 22198, 17437, 29989 ,20952; see 
Figure A.1) the counterparts identified in this work fall within 
the HECATE galaxies but do not coincide with the centre of the 
galaxy but rather with a source that could be either an ULX in the 
galaxy or an extragalactic source in the background. For these 7 
sources, dedicated studies will be needed to identify the exact 
origin of the X-ray emission. 


7T. Photometric Redshifts 


Photo-z of AGN and X-ray selected sources in general have de- 
veloped dramatically in the last 10 years, bringing the redshift 
accuracy and the fraction of outliers (usual quantities measured 
for assessing the quality of the photo-z) comparable to those 
measured for normal galaxies. 

Regardless of whether photo-z are computed via SED fit- 
ting or via Machine Learning, accurate photo-z for AGN are less 
straightforward to be obtained than for non-active galaxies (see 
Salvato et al. 2018, for a review on the topic). The main reason 
is that, for each multi-wavelength data point, the relative con- 
tribution of host and nuclear emission is unknown and redshift- 
dependent, with redshift being the parameter that we are trying 
to determine. To add to the difficulty, one should not forget the 
impact of dust extinction and variability, the latter an intrinsic 
property of AGN. This, especially for wide-area surveys where 
data are taken over many years, can affect noticeably the accu- 
racy of photo-z if not accounted for (e.g., Simm et al. 2015), as 
for example was possible to do in COSMOS (Salvato et al. 2009, 
2011; Marchesi et al. 2016). In eFEDS, we also have to face the 
issue that the photometry is not homogenised, and different sur- 
veys cover different parts of the field at different depth and with 
different ways of computing the photometry (Kron, Petrosian, 
apertures, model etc). In the following we describe the proce- 
dure adopted for computing photo-z using LePHARE (Arnouts 
et al. 1999; IIbert et al. 2006). We then proceed with an estimate 
of the reliability of the photo-z and a comparison with DNNZ, 
an independent computation of photo-z using machine learning 
(Nischizawa et al., in prep). 


7.1. Photo-z Computation 


We computed the photo-z for the sources classified as extra- 
galactic. In order to minimise systematic effects, we have used 
different types of photometry, depending on the survey; in par- 
ticular, we have tried to avoid photometry derived from models 
for the extended and nearby sources. This is because usual mod- 
els are good representation of point-like, disk-like and bulge- 
like sources, but are unable to represent, for example, a lo- 
cal Seyfert galaxy where nuclear and host components would 
be both contributing to the total flux. For this reason, we have 
used total fluxes from GALEX; Kron and cmodel photometry 
from HSC, depending on whether the source is extended or 
not (see below), and GAAP (Gaussian Aperture and Photom- 
etry) from KiDS+VIKING. From VHS we have adopted Pet- 
rosian photometry as it appears to be more in agreement with the 
VISTA/VIKING photometry. All the photometry was corrected 
for Galactic extinction, using E(B-V) from LS8. Depending on 
whether the source is in the area covered by KiDS+VIKING, 
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Fig. 9: The distribution of sources flagged as "Secure Galactic" (red) and "Secure Extragalactic" (blue) in the g-r vs z-W/ (left) and 
W1 vs X-ray (right) planes are used for determining a line separator (black line) to be used for classifying sources in steps 5, 6 and 
8 of the flowchart presented in Figure 8. The line separator on the right has fewer Galactic sources that fall into the extragalactic 
locus. However, the line separator defined on the left has literally only a handful of extragalactic sources falling into the Galactic 
locus, making this classifier more efficient, provided the four photometric points are available. 
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Fig. 10: The four eFEDS X-ray sources classes "Secure Galac- 
tic" (red), "Likely Galactic" (orange), "Secure Extragalactic" 
(blue) and "Likely Extragalactic" (cyan) defined in the flowchart 
presented in Figure 8 distributed according to their distance from 
the two lines defined in Figure 9. Three of the four wedges thus 
defined contain extragalactic or Galactic samples that are up to 
99% pure (see text for details). 


10 Tn particular, note that for HSC, in the S19A release available to us 
at the time of this work, photometry in r2 and i2 filters is provided. 
However the filters have changed during the survey and depending on 
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within HSC but outside KiDS and outside HSC, different bands 
were available? 

The computation of the photo-z followed the procedure al- 
ready outlined in Salvato et al. (2009, 2011); Fotopoulou et al. 
(2012); Hsu et al. (2014); Marchesi et al. (2016); Ananna et al. 
(2017), where sources are treated differently, depending on 
whether the optical images indicate them being extended (EXT) 
or a point-like/unresolved (PLIKE), following Section 6. This 
step is particularly important, as sources in the two samples are 
treated differently, using different priors and templates. 

In addition, the fitting templates are selected on the basis of 
the X-ray depth and coverage of the surveys, keeping in mind 
that e.g., bright AGN will be mostly absent in a deep pencil- 
beam survey, characterised instead by host-galaxy dominated 
sources. Given the similar X-ray depth, the libraries used in 
Ananna et al. (2017) for the Stripe-82X survey have been a good 
starting point for our work on eFEDS. However, recently a new 
library of templates for AGN and hybrids (AGN and host) was 
presented in Brown et al. (2019). The authors used photometry 
and archival spectroscopy of 41 AGN to create an additional set 
of 75 new hybrid templates. With respect to previous AGN tem- 
plates they have the advantage that they are empirical for the en- 
tire wavelength coverage and that the contribution from the host 
and AGN components is fully taken into account when creating 
the final SED, including dust attenuation and emission lines. 

eFEDS is particularly rich in sources with reliable spec- 
troscopy (see Section 2.3, allowing for a better tuning of the 
templates to be used for the photo-z computation. To optimize 
the template choice, the colours of all the sources with reliable 
spectroscopy were plotted as a function of redshift, together with 
the theoretical colours from all the templates available (Figure 11 


the coordinates of the sources the fraction of data obtained with the 
original or the new filters changes. In order to account for this at any 
location, we have adopted the filter that was used for obtaining at least 
50% of the data. This solution is not optimal and will affect the quality 
of the photo-z in some areas. 
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Fig. 11: i-zand W1-W2 colours of eFEDS extragalactic sources as a function of their (reliable) spectroscopic redshift (black points). 
Overplotted are the tracks of theoretical colours as a function of redshift derived from all the templates used in this work for the 
PLIKE (upper panels) and EXT (lower panels) samples, as listed in Appendix B 


illustrates this for i-z and W/-W2 for the EXT and PLIKE samples, 
respectively, for all the templates that ultimately were adopted 
in this work). In selecting the templates we tried to limit their 
number (to control degeneracy in the redshift solution), while at 
the same time compiling a list representative of the entire pop- 
ulation. The sources that in Figure 11 are outside the parame- 
ter space covered by the templates can be interpreted in various 
ways, from problems in the photometry of the specific objects 
due to blending with neighbours or variability or lack of cer- 
tain features in the templates. While we will further investigate 
this latter possibility for the future eROSITA surveys, here we 
need to keep in mind that the figures are representative of only 
two colours, while in selecting the templates we look at all the 
colours that our photometric set allows. 


An important point to keep in mind is the fact that despite be- 
ing rich, the available spectroscopic sample is not representative 
of the entire eFEDS population as can be seen in Figure 12. For 
this reason the final library should also include some templates 
for types of sources that are expected to be present in eFEDS, 


without being necessarily identified yet. In particular we have 
created a set of templates using the archetype of type 1 AGN 
from the counterparts of ROSAT/2RXS (Salvato et al. 2019) ob- 
served within SDSS-IV/SPIDERS presented in (Comparat et al. 
2020), extended in UV and MIR with various slopes. For the 
non empirical templates reddening was also considered, using 
the extinction law of Prevot (Prevot et al. 1984) with E(B-V) 
values from 0 to 0.4 in steps of 0.1. The selected templates are 
presented in Appendix B. 


As output Le PHARE provides the best value for the best 
photo-z together with the upper and lower 1,2,3 o error, the best 
combination of template, extinction law and extinction value, 
the quality of the fit and the pdz, the latter being the red- 
shift probability distribution defined as pdz= f F(z) dz between 
Zpes: + O.1(l + Zbest) With Zpes being the photo-z value corre- 
sponding to the best fit. For photo-z computed with a sufficiently 
high number of photometric bands covering the entire SED (e.g., 
XMM-COSMOS; Salvato et al. 2009) a high value of pdz can 
be safely translated in reliability of the photo-z. As it has been 
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Fig. 12: Magnitude distribution for the entire extragalactic sam- 
ple and the subsample for which reliable spectroscopy is avail- 
able. The vertical lines indicate the mean values of the two sam- 
ples. 


discussed in Brescia et al. (2019), this is not the case when the 
photometric set is not rich, and pdz can be high also for a poor 
fit just because there are no sufficient constraints. 


7.2. Reliability of photo-z 


The final comparison between photo-z and spec-z, consid- 
ering EXT and PLIKE sources together, for the area within 
KiDS+VIKING and within HSC but outside KiDS+VIKING is 
shown in Figure 13. We used the standard metrics for measuring 
the quality of photo-z (see Salvato et al. 2018, for more details on 
definitions): a) the fraction of outliers 77: it highlights the fraction 
of sources with unexpectedly large errors and it is defined as the 
fraction of sources for which |Zphot — Zspecl/(1 + Zspec) > 0.15 (e.g., 
Hildebrandt et al. 2010). b) accuracy oymap that describes the 
expected scatter between predictions and truths and it defined as 
1.48 x median(|Zphot — Zspecl/(1 + Zspec)) (bert et al. 2006). 


The results are listed in Table 7. Figure 14 shows the same 
results, but split as a function of z-band magnitude from LS8, 
X-ray flux and spectroscopic redshift. 


Ideally, for the best computation of photo-z, in particular for 
sources dominated by emission lines as AGN, photometry from 
broad band filters across the entire spectral range should be com- 
plemented by narrow band and near infrared photometry and 
should be homogenised (e.g Salvato et al. 2009, 2018). While 
narrow-band photometry is not available, at least some of the 
surveys do provide homogenised photometry. For what concerns 
NIR photometry, the VISTA/VHS data are not sufficiently deep 
and how this impacts the photo-z is clearly visible in all the pan- 
els of Figure 14, where the fraction of outliers is usually higher 
and the accuracy lower (high value of oNmap in the area with- 
out VIKING coverage (dotted lines). Not only are the NIR data 
shallow outside the KiDS+VIKING area; they are just a collec- 
tion of photometric points computed in different ways, simply 
matched in coordinates. For this reason, based on the footprints 
shown in Figure 1, we can think of the photo-z in eFEDS as 
divided in three regions that reflect the quality of the available 
photometry: the inner area covered by deep forced photome- 
try in KiDS+VIKING; the area that is within HSC but outside 
KiDS+VIKING for which some NIR information is provided by 
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the shallow VISTA/VHS; the area outside HSC for which the 
optical photometry is provided only by LS8. 

The lack of deep NIR data also creates an unusual number 
of sources at high-z (z>3), most of which are most likely incor- 
rect. For example the number of sources with photo-z > 3 is 
188 within KiDS and 819 in the HSC area outside KiDS, despite 
the area being about the same size. Within KiDS+VIKING, LE 
PHARE correctly estimates the redshift for 40 of the 55 (72.7%) 
sources spectroscopically confirmed to be at redshift larger than 
3. Most of these high-z sources in excess can be easily identified 
and flagged by noticing that they are characteri sed by having 
high pdz despite being in the area outside the HSC, i.e., with a 
very limited number of photometric points to be fitted (see Sec- 
tion 7.4). 

Figure 14 also shows how the accuracy degrades and the 
fraction of outliers increases for the PLIKE that are X-ray bright 
(top, second panels from the left). These are sources dominated 
by the AGN component with an SED close to a power-law and 
for which the lack of narrow-band photometry that would iden- 
tify the emission lines, does not allow breaking of the degener- 
acy in the redshift solutions. However, in eFEDS there are only 
47 extragalactic sources with an X-ray flux above 5 x 107? erg 
cm^? s^! and a reliable spectroscopic redshift is available for 39 
of them, so that the low quality of the photo-z for these sources 
has only a limited effect. Finally, Figure 14 shows an undesired 
high fraction of outliers at low redshift, where photo-z values 
for normal galaxies usually are extremely accurate. We believe 
that the problem for AGN originate from the fact that both KiDS 
and HSC photometry are based on fitted models and not to total 
fluxes. Models are not able to account properly for the contribu- 
tion of the nuclear component that is comparable to the one from 
the host. Photometry from models can represent well sources at 
very low redshift, where the AGN contribution is negligible with 
respect to the host, and at high redshift, where the flux is domi- 
nated by the AGN component. 

As already highlighted in the past it is always easier to obtain 
a reliable photo-z for galaxy dominated sources with the charac- 
teristic breaks in the SED. AGN dominated sources do suffer 
from degeneracy in the redshift solution, especially when little 
photometry is available, even within the KiDS+VIKING area 
(compare dashed lines for EXT and PLIKE). 


7.3. Comparison with DNNz 


Within the HSC collaboration, the computation of photo-z is 
available in many flavors. The method that performs better on 
AGN is DNNz (Nishizawa et al., in prep). It is based on machine 
learning and uses exclusively HSC photometry, trained on the 
rich spectroscopic sample available for both AGN and normal 
galaxies within the entire HSC region (beyond the area in com- 
mon with eFEDS). The DNNz is based on the Multi-Layer Per- 
ceptron (MLP) that takes the cmodel flux, PSF matched aperture 
flux, and the second order moment size measured at five HSC fil- 
ter bands as inputs, and takes posterior probability as an output. 
In total, 3 x 5 inputs and output PDF is binned in 100 bins from 
z = 0 toz = 7. We have five hidden layers and each layer has 
100 nodes where all nodes are fully connected to the nodes in the 
neighboring layers. With a 50k spectroscopic sample, it takes al- 
most a single day to train this machine with NVIDIA GeForce 
RTX 2080Ti GPU. 

One interesting feature of DNNz is that it was trained for any 
type of extragalactic source, without any particular tuning for 
AGN. In Table 7, the performances of DNNZz are directly com- 
pared with the output from Le PHARE. Remarkably, the accu- 
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Fig. 13: spec-z vs photo-z from Le PHARE for the sources inside KiDS+VIKING (left panel) and inside HSC but outside 
KiDS+VIKING (right panel). The sources in gray are considered outliers while the red lines correspond to 1) z phot = z spec 
(thick solid), 2) z phot = z spec +0.05 x(1+z_spec ) (solid) and 3) z phot 2z spec + 0.15(1  z spec ) (dotted). 


AREA sample Ls PHARE DNNz 

N spec/Nioi 7] ONMAD Nspec/N tot 7] ONMAD 

inside KiDS PLIKE 2331/6808 17.8% 0.048 2280/6808 21.8% 0.045 
EXT 1141/4598 . 6.7% 0.054 1139/4598 . 6.396 0.032 

TOTAL* | 3472/11406 14.1% 0.049 3419/11406 16.7% 0.039 

outside KiDS PLIKE 1399/8532 28.6% 0.081 1325/8532 34.9% 0.075 
EXT 488/4221 8.6% 0.040 483/4221 4.9% 0.026 

TOTAL* | 1887/12753 23.8% 0.068 1808/12753 27.2% 0.052 

outside HSC PLIKE 15/145 42.9% 0.164 N/A N/A N/A 
EXT 3/89 0.0% 0.039 N/A N/A N/A 

TOTAL 18/234 33.3% 0.122 N/A N/A N/A 


Table 7: Fraction of outliers and accuracy for Le PHARE and DNNz computed using the extragalactic sources with secure spectro- 
scopic redshift, split by area. In each row the difference between the numerators in the Nypec/Nior columns provides the number of 
sources with spectroscopy for which DNNz could not provide a photo-z, mostly because the HSC photometry is saturated for those 


Sources. 


racy of DNNz is in general higher than for LE PHARE, although 
with a higher fraction of outliers. 


Interestingly, despite using only HSC photometry also 
DNNz shows a remarkable difference in the quality of the 
photo-z for the sources within or outside the area covered by 
KiDS+VIKING. This is probably due to the combined photom- 
etry from the filters r and r2 and i and i2 that were changed 
during the survey. Most of the KiDS+VIKING area has been 
homogeneously observed only in i- and r- band, while the rest 
of the area has a mixture of observations. Taking this into ac- 
count, we can compare Le PHARE and DNNz in the area within 
KiDS+VIKING and split by TYPE. Figure 15 shows how both 
sets of photo-z suffer from some systematics (vertical and hori- 
zontal substructures) due on one side to the imbalance between 
galaxies and AGN in the training of DNNz and on the other hand 
to the degeneracies in the solution for power-law dominated 
AGN and limited availability in photometry for LE PHARE. 


However, when the photometry is sufficient and of good qual- 
ity, SED fitting can predict correctly the redshift of AGN also 
when higher than 3 (middle panel of Figure 15; sources in red). 
This is a current limitation for photo-z computed via machine 
learning given the small sample of this kind of sources available 
for training (see Nishizawa et al., in prep.). 


When the photo-z from DNNz is available, for each source 
we can measure the mean photo-z between the values pro- 
posed by the two methods. Assuming this value as the right 
one, we have that for 60.696 of the extragalactic sources with 
CTP_quality>2 DNNz and Le PHARE agree (|ZPLePHARE — 
ZPDNNzl < 0.15 x (1 + mean(ZpLePHARE; ZPDNNz)). The compari- 
son between Ls PHARE and the spectroscopic redshift for 3919 
sources with spec-z is shown in the third panel of Figure 15; the 
fraction of outliers with respect to the spectroscopic sample is 
extremely small and the accuracy very high, comparable to the 
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Fig. 14: Fraction of outliers (top) and accuracy (bottom) as a function of magnitude (z from LS8), 0.2-2.3 keV X-ray flux and 
spectroscopic redshift split for type (EXT/PLIKE) and in area (with/without VIKING coverage). The x-axes are binned in equal 
numbers of elements taking the quantiles between [0,1] in steps of 0.1. In other words, all lines also account for the size of each 


spectroscopic sub-sample. 


accuracy routinely obtained for normal galaxies, using purely 
broad band photometry. 


Table 8 summarizes the result for DNNz and Le PHARE sep- 
arately, within and outside KiDS+VIKING. For the about 7500 
sources for which the two methods provide results in disagree- 
ment, the spectroscopic sample does not help in discriminating 
the best photo-z, given that the spectroscopic sample is very 
small (756 and 553 sources in the two areas, respectively) and 
not representative of the magnitude distribution in the sample 
(mean r value of the spectroscopic sample 20; mean r value of 
the sample for which DNNz and Le PHARE disagree 21.5. See 
also next section and Figure 16). Photo-z derived via machine 
learning are well known to be very reliable only within the pa- 
rameter space represented by the training sample, having little 
predictive power outside (e.g., Brescia et al. 2019). Keeping this 
in mind, we decided to rely on the prediction power of SED fit- 
ting and rely on the results from Le PHARE. However, we re- 
port also the results from DNNz and flag the sources for which 
Ls PHARE and DNNz agree, or disagree, respectively (see next 
Section 7.4). 
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7.4. CTP. REDSHIFT and CTP. REDSHIFT. GRADE in the final 
catalog 


In the final catalog we report the spectroscopic redshifts (regard- 
less of their reliability) and the photo-z from both Le PHARE 
and DNNz. In addition, for each source we summarise in the two 
columns CTP. REDSHIFT and CTP. REDSHIFT. GRADE our best 
knowledge of redshift and its reliability. 

The column CTP REDSHIFT lists original spectroscopic 
redshift when it is available and reliable (NORMQ=3). The redshift 
is set to O for all the sources that are classified as GALACTIC 
(either SECURE or LIKELY) or for which reliable redshift is 
not available. To the remaining sources we assign the photo-z 
from Le PHARE. 


Then in the column CTP. REDSHIFT GRADE we provide a 
grade of confidence to the redshifts and the grades are: 


— CTP_REDSHIFT_GRADE=5 : This is the higher grade, as- 
signed to the sources with reliable spectroscopic redshift. 
Of the 6591 sources in this category 5377 are extra- 
galactic sources and 1214 are Galactic (6465/6591 with 
CTP_quality > 2). 
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Fig. 15: Direct comparison between photo-z computed in this work with Le PHARE and DNNz, within the HSC area for all the EXT 
(left panel) and PLIKE (central panel) sources. By construction true EXT sources should not have spectroscopic redshift exceeding 
z = 1. It is not possible to decide a a priori whether the photo-z are wrong, or whether the sources have been erroneously put in 
the EXT sample due to some issue of the photometry. In the middle panel in red we highlight the sources that have spectroscopic 
redshift higher than 3 (see text for details). Right panel: Comparison between photo-z from Le PHARE and spec-z for the sources 
with Le PHARE and DNNZz in agreement 


OUTLIERS FRACTION 
Le PHARE in KiDS outside KiDS 
Le PHARE and DNNz agree 3.7% [99/2663] 8.5% [107/1256] 
Le PHARE and DNNz disagree 49.9% [377/756] 51.3% [284/553] 
DNNz in KiDS outside KiDS 


Le PHARE and DNNz agree 
Le PHARE and DNNz disagree 


4.0% [106/2663] 
56.1% [424/756] 


6.8% [85/1256] 
64.0% [354/553] 


Table 8: Fraction of outliers for the photo-z computed with Le PHARE (top) and DNNz (bottom), split by area, with respect to 
the number of sources with spectroscopic redshift for which the two methods agree (first row) or disagree (second row) following 
the definition in Section 7.3. The table clearly indicates that when Le PHARE and DNNz disagree, DNNZ has a higher fraction of 
outliers among the spectroscopic sample, while when the two codes agree, the difference in fraction of outliers is marginal.Note 
that in the definition of agreement we used 1+mean(LEPHARE,DNNZ). The small difference in the fraction of outliers for the two 
methods when they agree, depends on how close they are to the real spectroscopic value 


— CTP. REDSHIFT. GRADE-4 : This is assigned to the sources 
for which the photo-z from Le PHARE and DNNZz agree 


in NIR: in fact the large majority if these high-z sources are 
concentrated in the area outside KiDS. 


(10949 in total, 9643 of which have CTP. quality > 2), 
because in the previous section we have demonstrated 
that for this subsample the fraction of outliers is very 
small and the accuracy very high. By construction, all 
the Galactic sources without spectroscopic redshift have 
CTP. REDSHIFT. GRADE-4 because DNNz and Le PHARE 
are set to zero and belong to this subsample (2995 sources). 


CTP. REDSHIFT. GRADE-3 : This is assigned to the sources 
with Le PHARE and DNNz in disagreement and pdz > 40 
(6741 in total, 6057 among the sources with CTP. quality 
> 2). The threshold at pdz > 40 has been set by looking at 
the fraction of outliers as a function of pdz in the sample 
with spectroscopic redshift (see Table 9). At the same time 
we searched for the value of pdz that minimised the num- 
ber of outliers and maximised the number of sources with 
z_phot>4. The latter is suspiciously too high and this is due 
to the lack of deep photometry not only in the UV but also 


CTP_REDSHIFT_GRADE=2 : This is assigned to the remain- 
ing sources with Le PHARE and DNNz in disagreement and 
pdz < 40, for which we are less confident about the photo- 
metric redshifts, In this group there are only 1326 sources, 
with 1092 having CTP_quality > 2. 


Figure 16 shows the distribution of the sources for each of 


the REDSHIFT_GRADE in the magnitude redshift plane. Indicated 
are also the mean value of the redshift and magnitude for each 
of the subsamples. 


7.5. Flagging sources likely associated with clusters of 


galaxies in the point-like sample 


By construction the eFEDS X-ray point-source catalog is ex- 
pected to have very little contamination from clusters of galax- 


les; 


still, a small probability that a source is actually a clus- 


ter remains, as shown from the simulations we performed for 
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pdz N. N. N. N. 

threshold | sources sources sources sources 
w/spec-z | w/spec-z  )w/z phot >4 
& outliers 

« 20 683 60/5287 46/60 103/386 

« 30 1058 93/5287 67/93 199/386 

< 40 1429 134/5287 94/134 230/386 

< 50 1915 197/5287 121/197 252/386 


Table 9: Properties distribution for secure counterparts classified 
as extragalactic sources with pdz lower than a certain threshold. 
The lower the pdz, the lower the quality of the fitting. 


eFEDS (Liu et al. 2021b). The reasons for clusters ending up 
in the point-like sample are manifold (Willis et al. 2021). Most 
obviously, clusters with small apparent size or at low detection 
likelihood can fall below the thresholds used to define the exten- 
sion of the X-ray source. In addition, clusters could leak into the 
point-like sample because of source splitting and superimposi- 
tion of a bright point source and a cluster. 

With this in mind we have run the multi-component matched 
filter cluster confirmation tool (MCMF; Klein et al. 2018, 2019) 
on the eFEDS point source catalog. We run MCMF as in the 
eFEDS extended sources catalog (Klein et al. 2021), after ad- 
justing some of the parameters (for example, limiting the area 
search from the X-ray position). As in Klein et al. (2021) we 
have defined a "contamination fraction", feont, which expresses 
the probability for an optical concentration of red galaxies to be 
a chance alignment along the line of sight to the X-ray source. 
This is the key selection criterion for selecting cluster candidates 
and immediately provides an estimate of the catalog contamina- 
tion. A catalog created by selecting fcont < a is expected to have 
contamination fraction of a, assuming the input catalog is highly 
contaminated. 

Because of the high number density of sources in the point- 
like sample and/or the possibility that the emission from an ac- 
tual cluster is split in many point sources, it can happen that 
many close X-ray sources point to the same optical cluster. A 
simple cut in front Will therefore yield a much larger sample of 
sources than real clusters in that catalog, causing the contami- 
nation fraction to be much higher than expected. To compensate 
for that, for each eFEDS point source that is close to an optical 
overdensity, an "environmental flag" is set to true for the source 
that is closest to the overdensity and that is at least 0.75 Mpc 
away from a cluster detected in the extent selected sample (Liu 
et al. 2021b; Klein et al. 2021) at similar redshift. Only when 
the flag is set to true is the point-source further considered as a 
candidate for being a cluster. 

For these latter subgroup of sources, following Klein et al. 
(2021), MCMF assigns a redshift to the cluster (via red- 
sequence). In addition, the photo-z of the counterpart to the 
point-like sources is recomputed assuming they are passive 
galaxies. The two redshifts are then compared with the one com- 
puted by LePuare as described in the previous section (Sec- 
tion 7). 

Combining all the information described above we define a 
new flag, Cluster. Class, which indicates the possibility that 
an eFEDS X-ray (point-like) source is actually a cluster or be- 
longs to a cluster. 


— Cluster class-5: CTP_QUALITY< 1 & fiy < 0.2 and 
the environmental flag set to true: the counterpart Nwav or 
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Fig. 16: eFEDS sources with reliable counterpart distributed 
in the magnitude vs. redshift plane, split among the various 
CTP_REDSHIFT_GRADE classes. For each panel also the mean 
value for redshift (vertical line) and magnitude (horizontal line) 
is indicated and the entire population is shown in light grey. 


ASTROMATCH is considered unreliable and the X-ray emission 
is more likely associated to a cluster (top-left panel of Fig- 
ure 17; 120 cases). 


Salvato et al.: Point-like sources in eFEDS 


Fig. 17: from top-left,clock-wise:four examples of Cluster_Class=5,4,3,2, as described in Section 7.5. X-ray contours are plotted 
in white, with the magenta cross indicating the X-ray position and magenta square indicating the counterpart selected in this paper, 
magenta contours indicate the red sequence galaxy density. The HSC g, ri cutouts are 5.5'x5.5'in size. 


— Cluster. class-4: CTP_QUALITY> 2 & feont < 0.2 with and the source is a cluster member (bottom-left panel from 
the environmental flag set to true, the optical colours of the the left of Figure 17; 96 cases). 
sources are typical of passive galaxies and the redshift com- — — Cluster class-2: CTP_QUALITY> 2 & fcon < 0.01 and 
puted with LePuare coincides with the redshift of the optical the environmental flag set to true while the photo-z com- 
cluster: the counterpart is reliable and the point source is a puted by the three methods are in disagreement: the counter- 
galaxy member (possibly the BCG) of the optically detected part is reliable and the source (AGN) is just projected on a 
cluster (to-right panel in Figure 17; 63 cases). likely cluster (bottom-right panel of Figure 17; 67 cases). 


In all the other cases the X-ray emission is from a genuine 
Cluster class-3: CTP_QUALITY> 2 & feont < 0.2 and point-source and the and it likely not from the extended, hot in- 
the environmental flag set to true and the redshift computed — tercluster medium. A dedicated effort is currently ongoing to 
assuming an AGN template is consistent with the redshift of confirm the secure clusters in the point source catalog as well 
the optical cluster but the optical colours of the counterpart as characterising and measuring their X-ray and radio properties 
are not typical of a passive galaxy: the counterpart is correct of the confirmed clusters (e.g., Bulbul et al. 2021). 
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Fig. 18: The distribution of Galactic and extragalactic sources in eFEDS, colour coded by redshift in the parameter spaces defined 
by typical colours and fluxes. For the plots with optical and MIR colours, the track of templates characteristic of the population, also 
used in the computation of the photo-z are overplotted. The legenda on the top-left panel provides all the details for the four panels. 


ner et al. 2021) associated with this paper are available via 


8. Data Release 
CDS/Vizier and via the web page at MPE dedicated to the 


The catalogs listing the properties of the counterparts to eFEDS 
point like sources in the Main and Hard selected samples (Brun- 
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Fig. 19: Left: Classes of X-ray emitting sources are shown in colour-colour space. The clouds for QSOs, elliptical galaxies and 
Seyfert 2 galaxies are derived from theoretical colour-redshift tracks computed with LePhare and a subset of SED templates used 
for the determination of photo-z in this work. Seyfert 2 and elliptical tracks were limited to z<1, while the QSO cloud is divided in 
z<2.5 and 2.5«z«4. The black arrow indicates the evolution of a theoretical QSO at z=0.2 in colour-colour space with increasing 
extinction. The ticks along this arrow are steps of Aga. y, = 0.1. The stellar cloud was derived from a 2D-histogram of MIST/MESA 
isochrones assuming [Fe/H]=-2 (Choi et al. 2016; Dotter 2016; Paxton et al. 2018). Right: The same classes are shown on top of 
the binned eFEDS data (CTP. quality > 2), colour-coded according to the average hardness-ratio (see text for details). 


eROSITA data release!!. The list of the columns and their de- 
scription for the two samples is available in Appendix D. Only 
the basic X-ray properties are listed here (columns 1-9). For the 
complete list please refer to the catalogs released in Brunner 
et al. (2021). After the columns reporting the key X-ray proper- 
ties of the sources, columns 10-36 report the results of the coun- 
terpart (CTP) association followed by the key parameters from 
NWAY, ASTROMATCH and Hamstar respectively. Next (columns 36- 
49) we present the photometry from the recent Gaia EDR3 re- 
lease, in the original photometric system, followed by all the 
collected photometry, corrected for extinction (columns 51-108). 
To be remembered is the fact that the HSC photometry from 
S19A in i- and r- bands has been split in i-, i2- and r-, r2- 
and that Kron is listed for EXT sources while cmodel is listed 
for PLIKE (see Section 7). Columns 109-117 list basic prop- 
erties of the sources, like whether they are within KiDS, HSC 
etc., while columns 118-126 deal with all the information on the 
spectroscopy available information related to spectroscopy when 
available. The output parameters from Le PHARE and DNNZ 
are listed in columns 127-148. The columns CTP. REDSHIFT 
and CTP. REDSHIFT. GRADE summarise the redshift properties 
of the sources as discussed in Section 4.2, while the column 
CLUSTER CLASS refers to the results presented in Section 7.5. 


In addition to the catalogs, we can provide under direct re- 
quest to the first author, the redshift distribution function and 
SED fitting of each source in the catalog. An example is shown 
in Figure C.1. 


!! https://erosita.mpe.mpg.de/edr/eROSITAObservations/ 
Catalogues/ 


9. Discussion 


The size and depth of the eFEDS X-ray survey, combined with 
ancillary data both in photometry and spectroscopy, allows us to 
paint a comprehensive picture of the average population of X-ray 
sources that contribute the bulk of the cosmic X-ray background 
(CXB) flux at energies « 10 keV (see e.g., Gilli et al. 2007), 
both in its Galactic and extragalactic content. The identification 
of the optical/IR counterparts, to a high degree of completeness 
and reliability, as discussed here, will facilitate detailed popula- 
tion studies of X-ray active stars, Galactic compact objects, and 
AGN. Here we briefly outline the main properties of our sample 
by examining in detail the distributions of the X-ray sources in 
various colour/redshift spaces. 


9.1. Population studies 


Figure 18 shows the distribution of all the eFEDS sources with 
a secure counterpart (CTP. quality22; see section 5) in four 
different multi-band photometric spaces, chosen for their wide 
applicability to large areas of the sky. 

The top panel shows sources in the z-W/ vs. g-r space, 
colour-coded by their redshift. Overlaid are a few representa- 
tive tracks of various classes of extra-galactic objects. Beside 
the clear separation between Galactic and extra-galactic objects 
already discussed in section 6, the X-ray points identify clear 
sequences of un-obscured QSO, obscured Seyferts and inac- 
tive galaxies. The inactive galaxies are best represented by the 
SO and Elliptical tracks, suggesting that some of these are the 
sources that are associated (or confused) with a cluster. Indeed 
the sources indicated by a yellow circle in the top left figure, have 
CLUSTER CLASS-344 indicating that they belong to a cluster and 
in most cases they are the BCG (see Section 7.5). These sources 
are best fit by the template of a passive galaxy as the spectra for 
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those available also suggest ( e.g., lack of emission lines from 
star formation, strong HK lines). However, some of the spectra 
together with the clear features from a non star-forming galaxy 
also reveal the presence of broad emission lines typical of AGN 
(see Bulbul et al., in preparation). 

The top-right panel of Figure 18 shows the distribution of 
points in the MIR (WISE W/) vs soft X-ray (0.5-2 keV) plane 
(same as Figure 9), originally introduced in Salvato et al. (2018). 
X-ray bright objects above the dashed line are typically AGN, 
while most of the IR bright objects below the line are Galactic 
X-ray emitting stars, with some contamination from nearby ex- 
tragalactic objects. These sources are rare, but given the size of 
eFEDS, they are a non-negligible number. Thus, when using this 
plot for other surveys one should not forget to account for the 
size of the survey. The larger the surveys the less efficient the 
line separator is. 

The bottom-right panel shows the distribution of the sources 
in the Wise-only W/-W2 vs W2 colour-magnitude plane". This 
is widely used to classify point sources, as it easily separates 
stars, with W/-W2 « 0, from QSOs, with W/-W2 > 0.5, (see 
e.g. Wright et al. 2010b; Assef et al. 2013). Once more, the 
eFEDS X-ray selection reveals the full extent of the extragalactic 
(AGN) population with intermediate IR colours between AGN- 
and host-galaxy dominated, typical of either obscured (Seyfert 2) 
or low-luminosity AGN (e.g., Merloni 2016; Hickox & Alexan- 
der 2018). 

Finally, the bottom-left panel shows the distribution of the 
eFEDS sources in the optical/MIR diagram defined by the "all- 
sky available" G-W/ vs W1-W2, frequently used to separate QSO 
from stars in the Gaia catalog. As already pointed out in Sec- 
tion 6, 10% of the Galaxtic sources are too faint to be detected 
by Gaia. This is even more true for the extragalactic sources: 
the plot shows only 57% of the entire eFEDS sample. However, 
the plot shows insights on the population that the first eROSITA 
All-Sky Survey (eRASSI) will uncover. As expected, the X-ray 
selected eFEDS sources contain, beyond stars and (unobscured) 
QSO, a tail at high G-WI (i.e. bright MIR, faint optical magni- 
tudes) typical of inactive galaxies and/or mildly obscured AGN. 

This is indeed confirmed by comparing the location of the 
extra-galactic eFEDS sources in the grzW/ plane with the X- 
ray hardness ratio measured from the X-ray counts in the bands 
where eROSITA is most sensitive. The right-hand panel of Fig- 
ure 19 shows the distribution of the sources in that plane, colour- 
coded by their average Hardness Ratio (defined as (H-S)/(H+S) 
where H and S are respectively the counts in the ranges 1.0-2.0 
keV and 0.2-1.0 keV ?), while the left-hand panel highlights the 
loci of the most common classes of sources based on the dis- 
tribution of templates tracks. Indeed, the hardest sources in the 
eROSITA band populate the optical/MIR colour-space of Seyfert 
2 galaxies and/or reddened QSO. A detailed discussion of the X- 
ray spectral properties of the AGN in the Main eFEDS sample 
will be presented in Liu et al. (20212). 


9.2. eFEDS stellar content 


The eFEDS field spans a wide range of Galactic latitudes (from 
about +20 to about +40). Reassuringly, the fraction of the X- 


12 For this plot we are using Vega System, so that user can compare the 
figure with similar ones done using AIIWISE all sky 

13 The hardness ratio is calculated using the columns ML CTS bl, 
ML CTS b2 and ML CTS b3 in Brunner et al. 2021 cata- 
log: (ML CTS b3 - (ML CTS bi + ML CTS b2))/(ML CTS b3 + 
(ML CTS bi + ML CTS b2)) 
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Fig. 20: Fraction of all X-ray sources with CTP_quality>2 
which are classified as Galactic as a function of Galactic Lati- 
tude. Purple symbols are for objects classified by the HamStar 
method; red symbols are for all "Secure Galactic" objects and 
orange for both "Secure" and "Likely" Galactic objects (see Sec- 
tion 3 for the definition). 


ray sources which are classified as Galactic (see Section 3 for 
the definition) increases towards low Galactic latitude, as shown 
in Figure 20. The figure also confirms that the priors adopted 
by Hamstar are not representative of all Galactic sources in 
eFEDS. In fact about 22.396 of the Galactic sources identified 
by Nway/ASTROMATCH are fainter than the 19th magnitude (with 
10% Gaia non detected). 


10. Conclusions 


We have presented the identification of the counterparts to the 
point sources in eFEDS listed in the Main and Hard catalogues 
Brunner et al. (2021), together with the study of their multiwave- 
length properties. eFEDS, having a limiting flux of Fo.5-2key ~ 
6.5 x 10° erg s^! cm? is a factor of ~ 50% deeper than the 
final eROSITA all-sky survey, and because of that can be used 
also as a forecast for eRASS:8, not only for the population that 
eRASS:8 will reveal, but also for the challenges that are ahead 
of us with respect to counterpart identification and redshift de- 
termination. 

e Counterpart identification: We have used nway Salvato et al. 
(2019) and astromatcu, each of them using, in addition to spatial 
information from eFEDS and LS8, a prior based on the proper- 
ties of a training sample of 3XMM sources, and tested on a val- 
idation sample of Chandra sources, made eFEDS-like in terms 
of positional accuracy. Each method has identified its own pri- 
ors in a different way. For the validation sample nway, identified 
correctly 95% of the sources with only 2% of sources having 
a second possible counterpart, against 89% and 10% for ASTRO- 
MATCH, but at the threshold adopted for p any and LR. BEST, 
both methods have very high completeness and purity (above 
95%). These remarkable results, well above the predicted com- 
pleteness and purity mentioned in Merloni et al. (2012), are due 
to three important factors: the development of new methods for 
identifying the correct counterparts, large samples of X-ray de- 
tected sources with known counterparts, and the availability of 
sufficiently deep, homogenised, multi-wavelength photometry 
from optical to MIR over very wide areas from which to con- 
struct the SED of these sources to be used as training. In the 
next two years, by the time eROSITA will have completed the 
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final all-sky survey, the methods will continue to improve and 
the training/validation samples will increase in size. Most im- 
portantly, the coverage of the multiwavelength catalogs usable 
for the identification of the counterparts will be larger. In fact, 
while the DESI Legacy Imaging Survey DR9 (LS9; Dey et al. 
2019b), just became publicly available, the work on DRIO has 
started. The survey will cover virtually all of the eROSITA-DE 
area of the sky at sufficient depth, thanks to the inclusion the DE- 
Cam data taken via the DeROSITAS survey (PI A. Zenteno). We 
predict that the identification of the counterparts for the entire 
eRASS will be at least of the same quality of eFEDS, also in the 
Galactic plane, thanks to the inclusion of Gaia EDR3 recently 
released. 

e Redshift determination: Given the lack of sufficiently deep NIR 
data outside the DES area (Sevilla-Noarbe et al. 2021), the possi- 
bility of obtaining reliable photometric redshifts via SED fitting 
will be low, at least until data from SpherEx (Doré et al. 2018) 
will be made available (launch planned for Summer 2024). How- 
ever, as demonstrated in Nishizawa et al (in prep) and Borisov 
et al. (2021), thanks to the increasing size and completeness of 
the spectroscopic sample that can be used for the training, re- 
liable photometric redshifts for any type of X-ray extragalac- 
tic sources are now possible. For example, thanks to the spec- 
troscopic follow-up of the eROSITA-DE sources planned via 
Vista/AMOST and SDSS-V/BHM, we will obtain redshifts for 
80% of the sources detected by eRASS:3, thus limiting the need 
of photo-z and, at the same time, assuring a high quality of 
photo-z that will use these spectroscopically confirmed sources 
as training. 
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Appendix A: Construction of the training, validation 
and field samples 


In the following, we describe the construction of the training, 
validation and associated field samples used for determining the 
different priors adopted by nway and AsTRoMarcu for the deter- 
mination of the counterparts and for assessing the reliability of 
the association, presented in Section 5. 


Appendix A.1: Reference sample selected from the 
3XMM-DR8 serendipitous source catalogue (a.k.a. 
training sample) 


We start with the 3XMM-DR8"" catalogue of X-ray detections, 
and estimate the 0.5-2 keV X-ray flux (and uncertainty) of each 
detection from the 0.5-1, and 1-2 keV band fluxes (and their 
uncertainties). We then select only those detections that meet all 
of the following X-ray quality criteria: 


i. have X-ray flux in the range probed by eFEDS (Fo5-2kev > 
2x 107! erg s^! cm", see figure 9 of Brunner et al. 2021), 
ii. have X-ray positions that have been aligned with the optical 
frame and that have uncertainty smaller than 1.5 arcsec, 
iii. have a signal to noise ratio for F5. »xev that is greater than 
10, 
iv. are consistent with being point-like at the resolution of 
XMM-Newton, 
v. have no close X-ray neighbours within 10 arcsec, 
vi. were not detected at the extreme off-axis angles, 
vii. were detected in XMM-Newton exposures of at least 5 ks, 
. were not labelled by the 3XMM pipeline as being confused, 
affected by high X-ray background or flagged as being prob- 
lematic for any reason. 


We then exclude any X-ray detections that lie in parts of the 
sky that are not representative of a well-chosen extragalactic 
survey field such as eFEDS, or where the optical imaging cat- 
alogue (LS8) is likely to be saturated/unreliable (due to very 
bright stars). Specifically, we exclude any X-ray detections that: 


i. lie near the Galactic plane (|b| < 15 deg), 
ii. lie near the Large or Small Magellanic clouds or M31 
(within 5, 3 and 1 degree radii respectively), 
iii. lie within the disks of bright (Br « 12) well-resolved galax- 
ies from de Vaucouleurs et al. (1991), 
iv. lie closer than 3 arcmin from any very bright star from the 
Yale Bright Star Catalog (Hoffleit 1964), or 
v. lie closer than 3 arcmin from any Tycho-2 (Hgg et al. 2000) 
star having Br « 9 or Vr « 9. 


After applying these criteria, we are left with a sample of 
36276 unique point-like X-ray sources with median positional 
uncertainty 0.57 arcsec. The X-ray flux distribution of the 
3XMM-DRS8-based training sample is broadly similar to that 
of the eFEDS main sample. We find that 92% of the train- 
ing sample have 0.5-2keV fluxes in the range 5x107!5 - 
1x107? erg s! cm™?, and median flux is 1.7x107" erg s^! cm?. 
The equivalent metrics for eFEDS main sample are 91% and 
1.0x107'4 erg s7! cm? respectively. 

We then carried out a positional match of this X-ray train- 
ing sample to the LS8, initially considering optical/IR objects 
that lie within 5arcsec of the X-ray positions. We used Nwav 


14 http://xmmssc.irap.omp.eu/Catalogue/3XMM-DR8/3XMM - 
DR8.html 


(Salvato et al. 2019) to carry out this cross-match, using only as- 
trometric information and number densities (Nway basic mode, 
ie. without any magnitude or colour priors). We retain only 
the X-ray sources with very secure unique optical counterparts. 
Specifically, we require that we consider only X-ray sources with 
>90% probability of having an optical/IR counterpart, and only 
cases where the best optical/IR counterpart is at least O times 
more probable than the next best possibility (p any » 0.9, p. i 
> 0.9). As before, we can afford to be very strict with these cri- 
teria, since we primarily care about purity and not completeness. 
These cuts result in a 3XMM/LSS reference sample of 20705 
high quality X-ray/OIR matches. 

We select a corresponding sample of non-X-ray emitting 
field objects from the LS8, using annular regions (15, 30 arcsec 
radii) around each of the 20 705 reference sample positions. The 
field sample was further filtered to remove any object that lies 
within 15 arcsec of any 3XMM-DRS source. This field sample 
contains just under 396 000 entries. 


Appendix A.2: Reference sample selected from the Chandra 
Source Catalogue v2.0 (a.k.a. validation sample) 


A supplementary X-ray/OIR reference sample was derived from 
the Chandra Source Catalogue v2.0!6. We used the Web API to 
retrieve all CSC2 sources that satisfied the following X-ray qual- 
ity criteria: i) Have Fo oy > 2x 107? erg s^! cm? (estimated 
from the standard CSC2 ‘s’ and ‘m’ bands), ii) have high sig- 
nificance > 6, iii) have a signal to noise ratio on Fo.5-2kev that 
is greater than 5, iv) have X-ray positions with 9596 uncertainty 
ellipse radius smaller than 1.0 arcsec , v) are consistent with be- 
ing point-like at the resolution of Chandra, vi) were detected in 
Chandra exposures of at least 1 ks, vii) were not labelled by the 
CSC pipeline as being confused, affected by readout streaks, or 
piled up. Exactly the same sky region filtering criteria were ap- 
plied to the CSC-based sample as were used to filter the 3XMM- 
based reference sample (see Section A.1). These criteria result 
in a sample of 6066 X-ray sources. 

We followed a similar process as before (Section A.1) to 
match the CSC2 sources to the LS8 catalogue, retaining only 
very secure matches (having p. any > 0.9, p. i > 0.9). This re- 
sults in a CSC-based X-ray/OIR reference sample that contains 
3415 objects. 


Appendix B: Templates used for photo-z 


As was discussed in the main text a key ingredient for comput- 
ing the photometric redshifts is the use of the most appropriate 
templates, able to represent the population under analysis. Here 
we list the templates that have been used for the EXT and PLIKE 
samples, respectively. For each of the templates we provide the 
name of the model, the corresponding number in the catalog and 
the reference to the paper that describes them. 


Appendix B.1: templates used for PLIKE 


The library of templates used for the PLIKE sample includes a 
mixture of SEDs from single objects and hybrids (different rel- 


'S in nway p. any is the probability, for each source in the primary cata- 


log (eFEDS in this case) to have a counterpart in the secondary catalogs; 
then, for each source in the secondary catalogues, p. i gives the proba- 
bility to be the correct counterpart to the source in the primary catalogue 
(see more in the Nwav manual and Salvato et al. (2019) 

16 https://cxc.cfa.harvard.edu/csc2/index.html 
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Fig. A.1: The seven sources for which our counterpart fall within an HECATE galaxy (see Vulic et al. 2021) but is probably pointing 
to a background extragalactic source. The RGB images are 1’x1’. In the cutouts the position of our proposed counterpart, the centre 
of the galaxy, the X-ray position and the positional error are indicated. 


ative contribution of host and nuclear component, as introduced 3. S0 20 QSO2 80 
in Salvato et al. (2009). Templates 1 and 12-17 and 20 are orig- 4. SO 30 QSO2 70 
inally from (Polletta et al. 2007). However, as templates N. 18 5. S0 40 QSO2 60 
and N. 19 (originally from SDSS-V", template N.20 has been 6. S0 50 QSO2 50 
extended in the UV and presented in Salvato et al. (2009). The 7. S0 60 QSO2 40 
same template was then used for creating hybrids templates by 8. SO 70 QSO2 30 
mixing it with normal galaxies templates (Noll et al. 2004) with 9. S0 80 QSO2 20 
different degree of star formation, as presented in the Ananna 10. S0 90 QSO2 10 
et al. (2017)/5. With a similar procedure the hybrid templates 2- 11. CB1_0_LOIII4 
10 where constructed combining an SO and a QSO2 template, 12. Sb 

both from Polletta et al. (2007). The hybrids were originally pre- 13. Spi4 

sented in Salvato et al. (2009) and were successfully used in Sal- 14. M82 

vato et al. (2009, 2011); Marchesi et al. (2016) among others. 15. 122491 

Finally, templates 24-29 are from the recent work of Brown etal. 16. Seyl8 


(2019). 17. Sey2 
18. pl QSOH 

1. SO 19. pl_QSO_DR2_029_10 

2. S0 10. QSO2. 90 20. pl TQSOI 


21. s250. 10 pl TQSOI1 90 
17 http://classic.sdss.org/dr5/algorithms/spectemplates 22. s180 30 pl TQSOI 70 
'8 Note that the templates are slightly different than in Ananna et al. in. 23. s800. 20 pl TQSOI 80 
the UV part 24. 3C120 
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. MRK110 
. NGC5548. 64.00. NGC4138 
. PG0052p251 

. NGC4151 

. NGC3783. 00.50. NGC4725 


Appendix B.2: templates used for EXT 


The templates used for computing the photo-z for the EXT sam- 
ple are almost entirely taken from Brown et al. (2019). The list 
of AGN templates includes SED of single objects (the name of 
the template is self explanatory) and hybrids constructed com- 
bining in different ratio the SED of two different galaxies and 
AGN. Additionally we have added two templates of elliptical 
galaxies from Polletta et al. (2007) and two newly created tem- 
plates of type 1 AGN. We have used the Type 1 archetype pre- 
sented in Comparat et al. (2020) and obtained by the stacking of 
all type 1 sources that are counterparts to ROSAT/2RXS (Boller 
et al. 2016; Salvato et al. 2019) that had an SDSS spectra. The 
SED was then extended in the MIR using the BQSO template 
presented in Polletta et al. (2007). BQSO is similar to TQSO but 
with less enhanced MIR flux. This new hybrid was then extended 
in the UV with various slopes. 


. 2MASXJ130005334+1632151 

. Ark564 

. F16156+0146 

. F2M11134+1244 

. Fairall9 

. H1821+643 

. IRAS. 1111943257 

. Mrk110 

. Mrk1502 

. Mrk231 

. Mrk290 

. Mrk493 

. Mrk590 

. Mrk817 

. NGC3227 Central 00.50 NGC4569 
. NGC3227 Central 01.00 NGC4569 
. NGC3227 Central 02.00 NGC4569 
. NGC3227. Central 04.00 NGC4569 
. NGC3227 Central 08.00 NGC4569 
. NGC3227 Central 16.00 NGC4569 
. NGC3227 Central 32.00 NGC4569 
. NGC3227 Central. 64.00 NGC4569 
. NGC3516 Central, 00.50 NGC4826 
. NGC3516 Central 01.00 NGC4826 
. NGC3516 Central. 02.00 NGC4826 
. NGC3516 Central, 04.00 NGC4826 
. NGC3516 Central, 08.00 NGC4826 
. NGC3516 Central. 16.00 NGC4826 
. NGC3516 Central. 32.00 NGC4826 
. NGC3516 Central. 64.00 NGC4826 
. NGC3516 Central 

. NGC3783 Central 01.00 NGC4725 
. NGC3783 . Central 02.00 NGC4725 
. NGC3783 . Central 04.00 NGC4725 
. NGC3783 Central, 08.00 NGC4725 
. NGC3783 . Central 16.00 NGC4725 
. NGC3783 Central 32.00 NGC4725 
. NGC3783 Central. 64.00 NGC4725 
. NGCA4051 Central. 00.50. NGC3310 
. NGC4051 Central. 00.50 NGC4125 
41. NGC4051 . Central 00.50 NGC4559 


wo o00-10 tA ULP. -— 


. NGC4051. Central 01.00 NGC3310 
. NGC4051 Central 01.00 NGC4125 
. NGC4051 Central 01.00 NGC4559 
. NGC4051 Central 02.00 NGC3310 
. NGC4051 Central 02.00 NGC4125 
. NGC4051 Central 02.00 NGC4559 
. NGC4051 Central 04.00 NGC3310 
. NGC4051 Central 04.00 NGC4125 
. NGC4051. Central. 04.00 NGC4559 
. NGC4051. Central. 08.00 NGC3310 
. NGC4051 Central 08.00 NGC4125 
. NGC4051. Central. 08.00 NGC4559 
. NGC4051. Central. 16.00 NGC3310 
. NGC4051 Central 16.00 NGC4125 
. NGC4051 Central 16.00 NGC4559 
. NGC4051. Central 32.00 NGC3310 
. NGC4051 Central 32.00 NGC4125 
. NGC4051 Central 32.00 NGC4559 
. NGC4051 Central 64.00 NGC3310 
. NGC4051 Central 64.00 NGC4125 
. NGC4051 Central 64.00 NGC4559 
. NGC4051_ Central 

. NGCA151 . Central 00.50 NGC4125 
. NGCA4151 Central 00.50 NGC4579 
. NGCA151 Central 01.00 NGC3310 
. NGCA151 Central 01.00 NGC4125 
. NGCA151 Central 01.00 NGC4579 
. NGCA151 Central 02.00 NGC3310 
. NGCA4151 Central 02.00 NGC4125 
. NGCA151 Central 02.00 NGC4579 
. NGCA151. Central 04.00 NGC3310 
. NGCA151 Central 04.00 NGC4125 
. NGCA151 Central 04.00 NGC4579 
. NGCA151 . Central 08.00 NGC3310 
. NGCA151 Central 08.00 NGC4125 
. NGCA151 Central 08.00 NGC4579 
. NGCA151 Central 16.00 NGC3310 
. NGCA151 Central 16.00 NGC4125 
. NGCA151 Central 16.00 NGC4579 
. NGCA151 . Central 32.00 NGC3310 
. NGCA151 Central 32.00 NGC4125 
. NGCA151 Central 32.00 NGC4579 
. NGCA151 Central 64.00 NGC3310 
. NGCA151 Central 64.00 NGC4125 
. NGCA151 Central 64.00 NGC4579 
. NGC5548 Central, 00.50 NGC4138 
. NGC5548 Central 01.00 NGC4138 
. NGC5548 Central 02.00 NGC4138 
. NGC5548 Central 04.00 NGC4138 
. NGC5548 Central, 08.00 NGC4138 
. NGC5548 Central 16.00 NGC4138 
. NGC5548 Central 32.00 NGC4138 
. NGC5548 Central. 64.00 NGC4138 
. NGC5548 Central 

. NGC5728 

. NGC7469 

. OQ 530 

. PG00264129 

. PGI4154451 

. PKS1345+12 

. Ton951 

. Ell4_ A 0 

. El5 A 0 

. pl BQSO ColI9 sl-8 

. pl BQSO Col9 sl-20 
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Fig. C.1: Example of SED fitting and redshift probability distri- 
bution (in the inset). The photometric points and their errors are 
indicated with black points. The best extragalactic and Galactic 
templates are shown with a solid line. The plot and the data for 
creating the plots are available under request. 


Appendix C: Release of PDZ and SED fitting 


For each primary (and secondary, in case it exists) counterpart 
to the eFEDS point sources we make available under request the 
redshift probability distribution and the SED fitting as in Fig- 
ure C.1. 


Appendix D: Columns description of released 
catalogs 


Most columns are common to both the Main and Hard sam- 
ple. We list column descriptions for the Main sample. In the 
few cases where there are differences in the column descriptions, 
we report in brackets the corresponding description for the Hard 
sample. The last column of the catalog (ID MAIN) is present 
only in the Hard catalog. 


1. ERO Name: eROSITA official source Name (see Brunner 
et al.) 

2. ERO ID: ID of eROSITA source in the Main Sample (from 
Brunner et al.) [ID of eROSITA source in the Hard sample 
(from Brunner et al.)] 

3. ERO RA CORR; J2000 Right Ascension of the eROSITA 
source (corrected) in degrees in the Main Sample (from 
Brunner et al.) [J2000 Right Ascension of the eROSITA 
source (corrected) in degrees in the Hard Sample (from 
Brunner et al.)] 

4. ERO Dec CORR:2000 Declination of the eROSITA 
source (corrected) in degrees in the Main Sample (from 
Brunner et al.) [J2000 Declination of the eROSITA source 
(corrected) in degrees in the Hard Sample (from Brunner et 
al.)] 

5. ERO RADEC ERR CORR: eROSITA positional uncer- 
tainty (corrected) in arcsec from the Main Sample (from 
Brunner et al.) [eROSITA positional uncertainty (corrected) 
in arcsec from the Hard Sample (from Brunner et al.)] 

6. ERO ML FLUX: 0.2-2.3 keV source flux in erg cm ? s^, 
converted from count rate assuming ECF=1.074e+12 
(Gamma=2.0). [In the Hard catalog the column is called 
ERO_ML_FLUX_3: 2.3-5 keV source flux in erg cm? s^, 
converted from count rate assuming ECF=1.147e+11 
(Gamma=2.0)]. See Brunner et al. 
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7. ERO_ML_FLUX_ERR: 0.2-2.3 keV source flux error (1 
sigma) in erg cm? s^!. [In the Hard catalog the column 
is called ERO ML FLUX ERR 3: 2.3-5 keV error (1 
sigma) in erg cm ? s^! ]. See Brunner et al. 

8. ERO DET LIKE: 0.2-2.3 keV detection likelihood mea- 
sured by PSF-fitting. [In the Hard catalog the column is 
called ERO DET LIKE 3: 2.3-5 keV detection likelihood 
measured by PSF-fitting]. See Brunner et al. 

9. ERO inArea90: true if in the 0.2-2.3keV exp»500s region, 
which comprises 9046 area (from Brunner et al.). 

10. CTP LS8 UNIQUE OBJID: LS8 unique identifier for the 
proposed counterpart to the eROSITA source (Expression: 
toString(LS8 BRICKID)-" "-toString(LS8 OBJID)) 

11. CTP LS8 RA: J2000 Right Ascension of the LS8 counter- 
part in degrees 

12. CTP LSS$ Dec: J2000 Declination of the LS8 counterpart 
in degrees 

13. Dist CTP LS8 ERO: Separation between selected coun- 
terpart and eROSITA (corrected) position in arcsec. 

14. CTP NWAY LS8 UNIQUE OBJID: LS8 unique identi- 
fier of the best LS8 countepart from NWAY (Expression: 
toString(LS8_BRICKID)+"_"+toString(LS8_OBJID)) 

15. CTP_NWAY_LS8_RA: J2000 Right Ascension of the LS8 
counterpart from NWAY in degrees. 

16. CTP_NWAY_LS8_Dec: J2000 Declination of the LS8 
counterpart from NWAY in degrees. 

17. CTP_NWAY dist bayesfactor: Logarithm of ratio be- 
tween prior and posterior, from separation, positional error 
and number density (see Appx. in Salvato et al 2018 for clar- 
ifications) 

18. CTP NWAY dist post: Distance posterior probability 
comparing this association vs. no association (see Appx. in 
Salvato et al 2018 for clarifications) 

19. CTP NWAY p single: Same as dist. post, but weighted by 

the prior (see Appx. in Salvato et al 2018 for clarifications) 

. CTP. NWAY p any: For each entry in the X-ray catalogue, 
the probability that there is a counterpart (see Appx. in Sal- 
vato et al 2018 for clarifications) 

21. CTP NWAY p i: Relative probability of the eROSITA/LS8 
match (see Appx. in Salvato et al 2018 for clarifications) 

. Dist NWAY LS8 ERO: Separation between the eROSITA 
position and the LS8 counterpart from NWAY in arcsec 

. CTP MLR LS8 UNIQUE OBJID: LS8 unique identi- 
fier of the LS8 counterpart from AsrRoMarCH (Expression: 
toString(LS8 BRICKID)-" "-toString(LS8 OBJID)) 

. CIP MLR LSS8 RA : J2000 Right Ascension of LS8 
counterpart from ASTROMATCH in degrees 

. CTP MLR LSS Dec: J2000 Declination of LS8 counter- 
part from ASTROMATCH in degrees 

. CIP MLR LR BEST: Likelihood Ratio value from as- 
TROMATCH 

. CTP MLR REL BEST: Reliability of the identification 
from ASTROMATCH 

. Dist MLR LS8 ERO: Separation between the eROSITA 

position and the LS8 counterpart from AsTROMATCH in arcsec 

CTP SAME: Comparison NWAY/MLR: true if the counter- 

part selected by the two method is the same 

. CTP MLR: Comparison NWAY/MLR: true if the coun- 
terpart from NWAY(MLR) has p any(LR BEST) be- 
low(above) threshold 

31. CTP Hamstar: Match to Hamstar: 1-same counterpart, 

O-different counterpart, -99=no Hamstar (Schneider et al) 

CTP Hamstar p stellar: Probability of association from 

Hamstar. 


29. 


32. 


. GaiaEDR3 phot rp mean mag: 


Salvato et al.: Point-like sources in eFEDS 


. Dist CTP Hamstar: Separation between the counterpart 


proposed by Hamstar and the counterpart selected in this 
work. 


. CTP quality: counterpart quality: 4=best, 3=good, 2=with 


secondary, 1/0=unreliable (see paper) 


. GaiaEDR3 ID: ID in Gaia EDR3 source catalog 
. GaiaEDR3 parallax: Parallax from Gaia EDR3 in mas 
. GaiaEDR3 parallax error: 


Parallax error from Gaia 
EDR3 in mas 


. GaiaEDR3_parallax_over_error: Parallax/Parallax error. 


(a ratio >5 define a SECURE GALACTIC counterpart) 


. GaiaEDR3_pmra: Proper motion in RA from Gaia EDR3 


. GaiaEDR3_pmra_error: Error on Proper motion in RA 
from Gaia EDR3 

. GaiaEDR3_pmdec: Proper motion in Dec from Gaia EDR3 

. GaiaEDR3_pmdec_error: Error on Proper motion in Dec 
from Gaia EDR3 

. GaiaEDR3 phot g mean mag: g band magnitude 
(VEGA) from Gaia EDR3 


. GaiaEDR3 phot g mean mag error: Error g band mag- 


nitude (VEGA) from Gaia EDR3 


. GaiaEDR3 phot bp mean mag: bp band magnitude 


(VEGA) from Gaia EDR3 


. GaiaEDR3 phot bp mean mag error: Error bp band 


magnitude (VEGA) from Gaia EDR3 


rp band magnitude 
(VEGA) from Gaia EDR3 


. GaiaEDR3 phot rp mean mag error: Error rp band 


magnitude (VEGA) from Gaia EDR3 


. FUV: Galex Far UV magnitude (AB magnitude) 

. FUV ERR: Galex Far UV magnitude error (AB magnitude) 
. NUV: Galex Near UV magnitude (AB magnitude) 

. NUV ERR: Galex Near UV magnitude error (AB magni- 


tude) 


. KiDS u: KiDS u-band magnitude (AB magnitude) 
. KiDS u ERR: KiDS u-band magnitude error (AB magni- 


tude) 


. KiDS g: KiDS g-band magnitude (AB magnitude) 
. KiDS g ERR: KiDS g-band magnitude error (AB magni- 


tude) 


. KiDS r: KiDS r-band magnitude (AB magnitude) 
. KiDS r ERR: KiDS r-band magnitude error (AB magni- 


tude) 


. KiDS i: KiDS i-band magnitude (AB magnitude) 
. KiDS i ERR: KiDS i-band magnitude error (AB magni- 


tude) 


. omegac z: OmegaCAM z-band magnitude (AB magnitude) 
. omegac z ERR: OmegaCAM z-band magnitude error (AB 


magnitude) 


. HSC g: HSC g-band magnitude (AB magnitude) 
. HSC g ERR: HSC g-band magnitude error (AB magni- 


tude) 


. HSC r: HSC r-band magnitude (AB magnitude) 

. HSC r ERR: HSC r-band magnitude error (AB magnitude) 
. HSC r2: HSC r2-band magnitude (AB magnitude) 

. HSC r2 ERR: HSC 12-band magnitude error (AB magni- 


tude) 


. HSC i: HSC i-band magnitude (AB magnitude) 

. HSC i ERR: HSC i-band magnitude error (AB magnitude) 
. HSC i2: HSC i2-band magnitude (AB magnitude) 

. HSC i2 ERR: HSC i2-band magnitude error (AB magni- 


tude) 


. HSC z: HSC z-band magnitude (AB magnitude) 
. HSC z ERR: HSC z-band magnitude error (AB magni- 


tude) 


75. 
76. 


77. 


78. 


79. 


80. 


81. 


82. 


83. 


84. 


85. 


86. 


115. 


HSC_Y: HSC Y-band magnitude (AB magnitude) 
HSC_Y_ERR: HSC Y-band magnitude error (AB magni- 
tude) 

VIKING z: VISTA/VIKING z-band magnitude (AB mag- 
nitude) 
VIKING_z_ERR: VISTA/VIKING z-band magnitude error 
(AB magnitude) 

VIKING_Y: VISTA/VIKING Y-band magnitude (AB mag- 
nitude) 
VIKING_Y_ERR: VISTA/VIKING Y-band magnitude er- 
ror (AB magnitude) 

VIKING_J: VISTA/VIKING J-band magnitude (AB mag- 
nitude) 
VIKING_J_ERR: VISTA/VIKING J-band magnitude error 
(AB magnitude) 

VIKING_H: VISTA/VIKING H-band magnitude (AB mag- 
nitude) 
VIKING H ERR: VISTA/VIKING H-band magnitude er- 
ror (AB magnitude) 

VIKING Ks: VISTA/VIKING Ks-band magnitude (AB 
magnitude) 

VIKING Ks ERR: VISTA/VIKING Ks-band magnitude 
error (AB magnitude) 


. WI: LS8/Wise W1 magnitude (AB magnitude) 

. W1 ERR: LS8/Wise W1 magnitude error (AB magnitude) 
. W2: LS8/Wise W2 magnitude (AB magnitude) 

. W2 ERR: LS8/Wise W2 magnitude error (AB magnitude) 
. W3: LS8/Wise W3 magnitude (AB magnitude) 

. W3 ERR: LS8/Wise W3 magnitude error (AB magnitude) 
. W4: LS8/Wise W4 magnitude (AB magnitude) 

. W4 ERR: LS8/Wise W4 magnitude error (AB magnitude) 
. LS8 g: LS8 g-band magnitude (AB magnitude) 

. LS8 g ERR: LS8 g-band magnitude error (AB magnitude) 
. LS8 r: LS8 r-band magnitude (AB magnitude) 

. LS8 r ERR: LS8 r-band magnitude error (AB magnitude) 
. LS8 z: LS8 z-band magnitude (AB magnitude) 

. LS8 z ERR: LS8 z-band magnitude error (AB magnitude) 
. VHS Y: VISTA/VHS Y-band magnitude (AB magnitude) 

. VHS Y ERR: VISTA/VHS Y-band magnitude error (AB 


magnitude) 


. VHS H: VISTA/VHS H-band magnitude (AB magnitude) 
. VHS H ERR: VISTA/VHS H-band magnitude error (AB 


magnitude) 


. VHS Ks: VISTA/VHS Ks-band magnitude (AB magnitude) 
. VHS Ks ERR: VISTA/VHS Ks-band magnitude error (AB 


magnitude) 


. HCS_g diff: Difference between psf and Kron magnitude in 


HSC g-band (AB magnitude) 


. HCS_r_diff: Difference between psf and Kron magnitude in 


HSC r-band (AB magnitude) 


. HCS_i_diff: Difference between psf and Kron magnitude in 


HSC i-band (AB magnitude) 


. HCS_z_diff: Difference between psf and Kron magnitude in 


HSC z-band (AB magnitude) 


. HCS_opt_extended: Extension in HSC griz bands. 1=ex- 


tended; -99=data missing O0=other from Aihara et al 2018 


. CTP_LS8_phot_flag: Flag for LS8 photometry: true when 


the source has simultaneously g,r,z,w1 photometry in LS8 


. CTP LSS Type: Morphological model from LS8 
. in KiDS flag: Flag for KiDS coverage: 1: Source is in KiDS 


area; 0: otherwise 
in HSC flag: Flag for HSC coverage: 1: Source is in HSC 
area as from Aihara et al 2018; 0: otherwise 
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116. SPECZ RA: Right Ascension (degrees) of the spectro- 
scopic redshift entry in the original catalogue from which 
it was taken 

117. SPECZ Dec: Declination (degrees) of the spectroscopic 
redshift entry in the original catalogue from which it was 
taken 

118. SPECZ Redshift: Spectroscopic redshift from original cat- 
alo 

119. SPÉCZ NORMQ: Normalised quality of spectroscopic 
redshift: 3=secure, 2=not secure, 1—unreliable redshift/bad 
spectrum, -1= Blazar candidate 

120. SPECZ_Origin: Catalogue which provided this spectro- 
scopic redshift 

121. SPECZ_Original_ID: Identifier of this spectroscopic red- 


shift entry in the original catalogue from which it was taken 
122. SPEC Gal flag : true when the CTP has a reliable redshift 


above 0.002 (boolean) 

123. SPEC Star flag : true when the CTP has a reliable redshift 
below 0.002 (boolean) 

124. CTP Classification : counterpart classification: SE- 


CURE/LIKELY GALACTIC/EXTRAGALACTI, as from 
flowchart (see paper) 

125. PHZ LePHARE zphot : Photo-z from Le PHARE, but set 
to 0 for GALACTIC sources 

126. PHZ LePHARE 168 : Le PHARE zphot min at 1 sigma 

127. PHZ LeePHARE zu68 : Le PHARE zphot max at 1 sigma 

128. PHZ LePHARE 190 : Le PHARE zphot min at 2 sigma 

129. PHZ LePHARE zu90 : Le PHARE zphot max at 2 sigma 

130. PHZ LePHARE 199 : Le PHARE zphot min at 3 sigma 

131. PHZ LePHARE zu99 : Le PHARE zphot max at 3 sigma 

132. PHZ LePHARE chi: Le PHARE chi? value for best fitting 
galaxy/AGN template 

133. PHZ LePHARE ModelAGN : Le PHARE best template 
fitting the data 

134. PHZ LePHARE extlaw : Le PHARE Extinction Law ap- 
plied to the template: Prevot (1) or none (0) 

135. PHZ LePHARE ebv : Le PHARE E(B-V) applied to the 
template 

136. PHZ LePHARE pdz : Le Phare probability distribution. 
Photo-z more reliable when value is high 

137. PHZ LePHARE nband : Le Phare number of bands used 
for the computation of photo-z 

138. PHZ LePHARE zp2 : Le Phare second best photo-z from 
LEPHARE, if existing 

139. PHZ LePHARE chi2 2 : Le Phare chi2 value for second 
best fitting template, if existing 

140. PHZ LePHARE ModelAGN 2 : Le Phare second best 
template fitting the data, if existing 

141. PHZ LePHARE p4dz2 : Le Phare probability distribution 
for secondary solution, if existing 

142. PHZ DNNz zphot: Photo-z from DNNZ (from Nishizawa 
et al.), but set to 0 for GALACTIC sources 

143. PHZ DNNz 7168 : DNNZ 1sigma min error on photo-z 

144. PHZ DNNz zu68 : DNNZ 1 sigma max error on photo-z 

145. PHZ DNNz 7195 : DNNZ 2 sigma min error on photo-z 

146. PHZ DNNz zu95 : DNNZ 2 sigma max error on photo-z 

147. CTP. REDSHIFT : Final redshift: zspec (NORMQ=3) 
when available, else photo-z from LE PHARE 

148. CTP. REDSHIFT GRADE : In a range from 5 (spec- 
troscopy) to 0 (unreliable photo-z) (see text for details) 

149. CLUSTER CLASS: In range from 5 to 1: 5=most likely a 
cluster; 1= not a cluster (see text for details) 

150. CTP CLASS : same as CTP Classification, but with num- 
bers: 3: SECURE EXTRAGALACTIC; 2: LIKELY EX- 
TRAGALACTIC; 1: SECURE GALACTIC; 0: LIKELY 
GALACTIC 

151. ID MAIN: [column present only in the Hard sample: The 
source ID in the Main catalog for the sources in common.] 
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